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INTRODUCTION 

Background 

Several  years  ago,  we  isolated  the  cDNA  for  a  protein  that  binds  to  the  promoter  of  the  c-Myc 
proto-oncogene  and  represses  its  expression  (1).  This  cDNA  was  found  to  encode  the  human 
homologue  of  Drosophila  Cut,  a  homeodomain  protein  believed  to  determine  cell-type  specificity  in 
several  tissues.  The  human  protein  has  been  called  the  CCAAT-displacement  protein  (CDP)  or  CDP/Cut, 
and  the  gene  encoding  it,  CUTL-1  (Cut-hke  l).The  CUTLl  gene  was  mapped  to  chromosome  7,  band 
7q22,  a  chromosomal  region  that  is  frequently  rearranged  in  some  human  cancers  (2,  3).  Two  sets  of 
data  suggested  that  CUTLl  may  be  altered  in  some  breast  cancers.  First,  female  transgenic  mice 
expressing  the  Polyomavirus  (PyV)  Large  T  (LT)  antigen  under  the  control  of  the  mouse  mammary 
tumor  vims  long  terminal  repeat  (MMTV-LTR)  frequently  develop  mammary  tumors  and  uterine 
leiomyomas  (4).  The  results  of  coimmunoprecipitation  analyses  revealed  that  specific  complexes  of 
CDP/Cut  and  F^V  LT  antigen  could  be  detected  in  both  leiomyomas  and  mammary  tumors  (4).  Although 
the  functional  significance  of  this  interaction  remains  to  be  determined,  the  existence  of  complexes 
between  CDP/Cut  and  a  viral  oncoprotein  suggested  that  alterations  in  the  function  of  CDP/Cut  may  be  an 
important  event  in  the  etiology  of  breast  cancer.  In  agreement  with  this,  in  a  pilot  study,  we  identified 
LOH  at  CUTLl  in  6  of  63  informative  patients  (9.5%).  Using  polymorphic  markers  covering  the 
entirety  of  7q,  we  found  that  the  region  of  LOH  was  very  large  and  included  7q31  as  well  as  7q22  in  4  of 
these  6  cases,  while  in  the  2  remaining  cases  only  7q22  was  deleted.  These  results  suggested  that  two 
regions  of  the  long  arm  of  chromosome  are  deleted  in  a  fraction  of  breast  cancers:  7q22  and  7q31. 

Hypothesis 

In  our  proposal  of  1996,  we  hypothesized  that  the  human  Cut  gene,  CUTLl ,  may  function  as  a 
tumor  suppressor  gene  whose  deletion  and/or  mutation  represents  an  important  event  in  die  etiology  of 
some  breast  tumors. 

SpeciBc  Aims 

Our  specific  aims  were  to  use  polymorphic  markers  within  and  around  CUTLl  to  confirm 
LOH  of  7q22  in  breast  cancers,  to  determine  whether  CUTLl  is  situated  in  the  minimal  region  of  7q22 
that  is  deleted  in  some  breast  tumors,  to  verify  whether  CUTLl  is  mutated  in  these  tumors  and  to 
determine  whether  LOH  of  CUTLl  can  be  used  as  a  prognostic  marker  for  breast  tumors. 

BODY 


Task  1:  Months  1-12  Determine  the  frequency  of  LOH  of  CUTLl  in  patients  with  sporadic  breast 

cancer  using  tumor  samples  from  the  Royal  Victoria  Hospital,  the  Manitoba 
Breast  Tumor  Bank  and  Dr.  Rosette  Lidereau. 

The  results  of  this  study  were  published  in  April  1999  in  the  journal  Oncogene  (5)  (see  Appendix 
1).  Briefly,  we  found  LOH  of  CUTLl  in  12  out  of  66  (18%)  sporadic  breast  tumors  (Appendix  1,  Fig.  1 
and  2,  and  Table  2).  Among  these  12  tumors,  the  LOH  region  encompassed  the  7q31  region  in  5  cases 
only.  Thus,  our  results  defined  a  second  region  of  LOH  on  the  long  arm  of  chromosome  7.  The  smallest 
deleted  region  in  these  12  tumors  included  CUTLL  The  boundary  of  the  smallest  deleted  region  on  the 
telomeric  side  was  located  between  markers  D7S578  and  D7S515,  and  on  the  centromeric  side  between 
markers  D7S515  and  D7S666.  It  should  be  noted  that  the  last  two  markers  are  positioned  within  introns 
3  and  6  of  CUTLl,  respectively.  These  results  were  consistent  with  the  notion  that  CUTLl  is  a  tumor 
suppressor  gene.  If  CUTLl  is  not  the  tumor  suppressor  gene,  at  the  minimum  our  results  provide 
mapping  information  that  will  be  useful  for  an  eventual  positional  cloning  strategy. 
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V 


Task  2  Months  12-24  Establish  correlation  between  LOH  of  CUTLl  and  other  clinical  or 

pathological  parameters.  Initiate  retrospective  studies  to  confirm  any 
correlation. 

To  determine  whether  loss  of  genetic  material  at  7q22  may  be  associated  with  pathological 
features  of  breast  tumors,  we  analyzed  available  clinical  data  including  the  tumor  type,  grade  and  size, 
estrogen  and  progesterone  receptor  (ER  and  PR)  status,  occurrence  of  lymph  node  metastasis  and  age  of 
onset  (Appendix  1,  Table  3).  Statistical  analysis  was  performed  using  the  Mann- Whitney  test  (6).  We 
found  no  correlation  between  LOH  at  7q22  and  tumor  type  or  grade,  ER  or  PR  expression,  nodal  status, 
or  age  of  onset.  However,  there  was  a  significant  association  (p  value  =  0.0082)  between  LOH  at  7q22 
and  tumor  size  (Appendix  1 ,  Table  3).  The  average  sizes  of  breast  tumors  with  or  without  LOH  at  7q22 
were  4.99  and  2.93  cm  respectively.  The  difference  in  size  was  further  accentuated  in  the  group  of 
tumors  with  LOH  at  7q22  but  retention  of  7q31  (6.5  Vs  2.99  cm;  p  value  =  0.0074).  In  contrast,  no 
significant  correlation  was  found  between  LOH  at  7q31  and  any  clinical  feature. 


T ask  3  Months  1-12  Set  up  a  PCR-based  approach  to  measure  c-Myc  gene  copy  number  in  breast 

tumors.  Analyze  c-Myc  gene  status  in  parallel  to  CUTLl  in  all  samples. 

Two  sets  of  primers  derived  from  the  c-Myc  gene  intron  1  were  used  to  measure  c-Myc  gene  copy 
number  in  breast  tumors.  As  a  control,  a  primer  set  from  the  glyceraldehyde-phosphate  dehydrogenase 
was  included  in  the  PCR-reactions.  In  addition,  another  strategy  was  used  to  verify  whether  one  allele  of 
the  c-Myc  gene  was  amplified.  The  primers  were  chosen  so  as  to  flank  a  CA  repeat  that  is  found  at  the  3' 
end  of  c-Myc  intron  1 .  Since  CA  repeats  are  highly  polymorphic,  in  many  samples  these  primers  allow 
the  amplification  of  two  products  that  can  be  distinguished  on  the  basis  of  their  difference  in  size.  If  one 
allele  is  present  in  several  copies  in  a  tumor,  PCR-amplification  would  produce  a  band  of  greater  intensity 
as  compared  to  the  other  allele. 

As  hypothesized  in  our  proposal  of  1996,  c-Myc  gene  amplification  was  not  found  among  the  12 
tumors  with  CUTLl  LOH.  However,  in  breast  tumor  cell  lines  (see  Tasks  5  and  6),  we  did  not  find  any 
correlation  between  c-Myc  expression  and  CDP/Cut  DNA  binding  activity.  For  this  reason,  we  decided 
not  to  analyze  more  breast  tumors  for  c-Myc  amplification. 

We  did  not  find  any  tumor  with  c-Myc  amplification  in  our  panel  of  breast  tumors.  Moreover,  in 
breast  tumor  cell  lines  (see  Tasks  5  and  6),  we  did  not  find  any  correlation  between  c-Myc  expression  and 
CDP/Cut  DNA  binding  activity.  For  this  reason,  we  decided  not  to  analyze  more  breast  tumors  for  c- 
Myc  amplification. 

Task  4  Months  12-24  Compare  the  prognostic  values  of  CUTLl  LOH  and  c-Myc  amplification. 

Changes  in  either  c-Myc  or  CUTLl  will  be  related  with  various  clinical  and 
biological  parameters  including  the  histologic  types  (alveolar  or  ductal), 
tumor  grades,  lymph  node  infiltration,  labeling  index  (or  tumor  cell 
proliferation),  erbB2,  estrogen  and  progesterone  receptor  status,  disease-free 
survival  and  overall  survival  rates. 

As  stated  above  (Task  2),  we  found  no  correlation  between  LOH  of  CUTLl  and  tumor  type  or 
grade,  ER  or  PR  expression,  nodal  status,  or  age  of  onset  (Appendix  1,  Table  3).  Since  LOH  of 
CUTLl  was  found  in  breast  tumors  of  all  grades,  we  concluded  that  this  genetic  change  occurs  early  in 
the  development  of  breast  tumors.  As  a  consequence,  CUTLl  LOH  will  not  have  useful  prognostic 
value.  However,  in  the  course  of  our  study,  we  identified  a  novel  CUTLl  mRNA  that  was  found  to  be 
expressed  at  higher  level  in  two  breast  tumors  with  an  invasive  phenotype  (discussed  in  Task  7,  see  Fig. 
3).  These  findings  led  to  the  submission  of  a  Revised  Statement  of  Work  in  July  1999,  in  which  among 
other  things  I  proposed  to  test  whether  expression  of  this  novel  mRNA  may  have  useful  prognostic 
value.  The  results  are  described  under  Task  12. 

Task  5  Months  1-24  Investigate  Cut  expression  as  well  as  Cut  DNA  binding  and  repression 

activity  in  a  panel  of  cell  lines  derived  from  breast  tumors.  Steady-state 
levels  of  Cut  and  c-Myc  mRNAs  and  proteins  will  be  measured  by  Northern 
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and  Western  blot  analyses.  Evaluate  Cut  DNA  binding  aetivity  by 
electrophoretic  mobility  shift  assay  (EMSA).  Assess  Cut  repression  activity 
by  cotransfection  studies  following  RT-PCR  cloning  of  Cut  cDNAs  from 
these  cell  lines. 

CDP/Cut  expression  and  DNA  binding  activity  have  been  analyzed  in  a  panel  of  breast  tumor  cell 
lines,  nontumorigenic  immortalized  mammary  epithelial  cell  lines  and  non  immortalized  human  mammary 
epithelial  cells  (HMEC).  In  parallel,  c-Myc  mRNAs  were  measured  to  verify  whether  elevated  CDP/Cut 
DNA  binding  activity  correlates  with  lower  c-Myc  mRNA  levels.  Several  findings  were  made. 

CDP/Cut  DNA  binding  activity  varied  greatly  among  cell  lines  (Fig.  1  A,  EMSA  of  CDP/Cut  in 
human  mammary  cells;  Fig.  2A,  EMSA  of  CDP/Cut).  Unexpectedly,  there  was  no  correlation  between 
CDP/Cut  activity  and  c-Myc  m^A  levels. 

CDP/Cut  protein  expression  was  analyzed  by  Western  blot  analysis  (Fig.  IB  and  2B).  CDP/Cut 
has  an  expected  M.W.  of  160  kDa  and  migrates  in  SDS-PAGE  with  an  apparent  M.W.  of  180-200  kDa. 
In  breast  tumor  cell  lines,  a  large  number  of  bands  were  observed  with  anti-CDP/Cut  antibodies  raised 
against  various  regions  of  CDP/Cut.  Whether  all  these  bands  represent  true  CDP/Cut  proteins  or  are 
unrelated  cross-reacting  proteins  was  not  entirely  clear.  In  light  of  these  results,  we  have  decided  to 
develop  novel  antibodies  raised  against  short  regions  of  CDP/Cut  (see  Task  11).  Nevertheless,  one 
important  findings  was  made:  there  was  no  correlation  between  DNA  binding  and  the  abundance  of  the 
full  length  CDP/Cut  protein,  even  when  cell  extracts  were  treated  with  phosphatases  (Fig.  2 A  and  B). 
Additional  results  in  fact  suggested  that  a  protein  shorter  than  full  length  CDP/Cut  is  responsible  for 
CDP/Cut  DNA  binding  activity.  This  issue  will  be  discussed  further  in  the  conclusion  section,  under  the 
title  "A  Shorter  CDP/Cut  Protein  Is  Responsible  for  CDP/Cut  DNA  Binding  Activity". 

Since  the  Cdc25A  phosphatase  had  been  found  to  stimulate  CDP/Cut  DNA  binding,  we  measured 
Cdc25A  expression  by  Western  blot  analysis.  Although,  Cdc25A  was  expressed  at  high  levels  in  several 
breast  tumor  cell  lines,  we  did  not  find  a  correlation  between  Cdc25A  expression  and  CDP/Cut  DNA 
binding  activity  (Fig.  1C).  Since  overexpression  of  CDC25B  was  detected  in  32  percent  of  human 
primary  breast  cancers  tested,  it  is  possible  that  CDP/Cut  activity  in  breast  tumor  cell  lines  correlates  with 
the  latter  (7).  This  remains  to  be  verified. 

CDP/Cut  contains  four  DNA  binding  domains:  the  Cut  homeodomain  and  three  regions  called  Cut 
repeats  (8-11).  In  other  studies  performed  in  parallel  to  this  project,  we  demonstrated  that  CDP/Cut  DNA 
binding  activity  can  be  inhibited  following  phosphorylation  of  Cut  repeats  by  casein  kinase  II  (CKII)  or 
protein  kinase  C  (PKC),  or  of  the  Cut  homeodomain  most  likely  by  a  cyclin-dependent  kinase  (12-14). 
Moreover,  CDP/Cut  DNA  binding  activity  can  be  stimulated  following  dephosphorylation  by  the  Cdc25A 
phosphatase  (12).  In  practice,  inhibition  of  CDP/Cut  DNA  binding  activity  can  be  monitored  by  treating 
cellular  extracts  with  a  non-specific  phosphatase  prior  to  EMSA  .(12-14) .  As  expected,  in  most  cell 
lines,  CDP/Cut  DNA  binding  activity  was  down-modulated  by  phosphorylation.  An  example  is  shown 
in  Fig.  2A. 


Task  6  Months  1-24  Correlate  Cut  activity  with  c-Myc  expression  and  gene  copy  number  in  cell 

lines  derived  from  breast  tumors. 

c-Myc  mRNA  levels  were  investigated  in  a  panel  of  breast  tumor  eell  lines  and  compared  with 
CDP/Cut  DNA  binding  activity.  We  found  no  correlation  between  c-Myc  expression  and  CDP/Cut  DNA 
binding  activity. 

We  originally  cloned  the  human  CDP/Cut  cDNA  using  a  protein  binding  site  from  the  promoter  of 
the  c-Mye  proto-oncogene  and  we  accumulated  biochemical  evidence  to  show  that  CDP/Cut  can  repress 
c-Myc  (1).  Our  recent  studies,  however,  indicated  that  there  is  no  physiological  correlation  between 
CDP/Cut  DNA  binding  activity  and  repression  of  c-Myc  (12).  In  GO,  when  c-Myc  transcription  was  off, 
CDP/Cut  DNA  binding  activity  was  also  very  weak.  No  change  in  CDP/Cut  was  observed  when  c-Mye 
transcription  was  upregulated  as  cells  reentered  the  cell  cycle  in  early  Gl.  CDP/Cut  DNA  binding 
increased  later  at  the  Gl/S  transition,  a  time  at  which  c-Myc  transcription  was  stable.  Thus,  we  are 
compelled  to  conclude  that  in  fibroblastic  cells  and  breast  tumor  cell  lines,  CDP/Cut  does  not  contribute  to 
the  repression  of  c-Myc.  We  cannot  exclude  the  possibility  that  in  certain  differentiated  cells  CDP/Cut 
may  contribute  to  the  down-regulation  of  c-Myc,  but  the  evidence  for  this  so  far  is  lacking. 
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A  EMSA  of  CDP/Cut  in  mammary  cells 
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Fig.  1  Electrophoretic  mobility  shift  assay  (EMSA)  and  immunoblotting  analysis  of  CDP/Cut  in  human 
mammary  cell  extracts 

Cellular  extracts  were  prepared  from  human  mammary  cells  and  used  in  electrophoretic  mobility  shift  assays 
(EMSA)  using  a  CDP/Cut  consensus  binding  site  (A)  and  in  immunoblotting  using  anti-CDP/Cut  antibodies 
(B),  or  anti-Cdc25A  antibodies  (C).  Note  that  there  is  no  correlation  between  DNA  binding  and  the 
abundance  of  the  full  length  CDP/Cut  protein.  Which  of  the  bands  revealed  by  anti-CDP/Cut  antibodies  is 
responsible  for  CDP/Cut  DNA  binding  activity  cannot  be  determined  with  TF2  antibodies  (see  Fig.  2  for  the 
diagram  of  TF2  Ab). 

Nepveu,  Fig.  1 
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Fig.  2  EMSA  and  Immunoblot  analysis  of  CDP/Cut 

HeLa  cells  were  transfected  with  vectors  expressing 
either  full  length  CDP/Cut  or  a  mRNA  initiated  within  intron  20. 
Cellular  extracts  were  prepared  from  HeLa  and  breast  tumor  cell 
lines  and  used  in  electrophoretic  mobility  shift  assays  (EMSA) 
using  a  CDP/Cut  consensus  binding  site  (A)  and  in 
immunoblotting  using  anti-CDP/Cut  antibodies  (B). 

Note  that  extracts  from  transfected  cells  generated 
retarded  complexes  that  migrated  close  to  each  other.  Also, 
which  of  the  bands  revealed  by  anti-CDP/Cut  antibodies  is 
responsible  for  CDP/Cut  DNA  binding  activity  cannot  be 
determined  with  certitude  using  these  antibodies. 


Nepveu,  Fig.  2 


Task  7  Months  12-24  Investigate  expression  of  CUTLl  in  breast  tumors. 

In  tumors  with  LOH,  establish  if  the  remaining  CUTLl  allele  is  expressed. 

Using  reverse-transcriptase  polymerase  chain  amplification  (RT-PCR),  we  analyzed  CDP/Cut 
mRNA  expression  in  tumors  with  LOH  of  CUTLl  (Fig.  3).  The  results  demonstrated  that  CDP/Cut 
mRNA  are  expressed  in  breast  tumors  with  LOH  of  CUTLl.  In  addition  to  the  full  length  CDP/Cut 
mRNA  transcript,  we  detected  expression  of  a  CDP/Cut  mRNA  which  is  initiated  within  intron  20. 
Interestingly,  expression  of  this  mRNA  was  elevated  in  two  breast  tumors  with  an  invasive  phenotype 
(Fig.  3 A,  lanes  7  and  13).  In  normal  tissues,  RNase  mapping  analysis  indicated  that  this  novel  mRNA  is 
expressed  in  placenta  (Fig.  3B). 


Task  8  Months  12-36  In  breast  tumors  with  CUTLl  LOH,  determine  whether  the  remaining  allele 

is  mutated. 

Months  12-24  For  the  tumors  for  which  RNA  is  available,  clone  CUTLl  cDNAs  by  RT- 
PCR. 

Using  RT-PCR,  we  have  cloned  CUTLl  cDNAs  from  5  tumors.  DNA  sequencing  was 
performed  over  the  regions  encoding  the  CDP/Cut  functional  domains.  From  the  5'  end  (or  amino- 
terminal  end),  these  regions  include  the  coiled-coil  region.  Cut  repeat  1,  Cut  repeat  2,  Cut  repeat  3,  the 
Cut  homeodomain,  and  the  carboxy-terminal  repression  domains  (Fig.  4).  Two  strategies  were  used  for 
DNA  sequencing.  Either  PCR-sequencing  or  isolation  of  individual  cDNA  clones  followed  by  DNA 
sequencing  of  3  individual  clones  for  each  breast  tumor.  Similar  results  were  obtained  by  these  two 
strategies.  No  mutation  was  identified,  however,  a  number  of  polymorphisms  were  identified. 

Months  12-24  Determine  the  CUTLl  exon/intron  structure,  including  the  sequence  at  the  5' 
and  3'  end  of  introns.  Design  and  test  primers  for  PCR-amplification  of 
CUTLl  exons. 

We  have  completed  the  CUTLl  exon/intron  structure,  including  the  sequence  at  the  5'  and  3'  end 
of  introns.  Our  results  were  published  in  January  2000  in  the  journal  Gene  (15)  (see  Appendix  2).  In 
short,  the  gene  spans  at  least  340  Kbp  and  contains  33  exons  (Appendix  2,  Fig.  1).  Synthesis  of  five 
different  transcripts  involves  2  promoter  regions,  2  polyadenylation  sites  and  7  alternative  splicing 
events.  The  two  polyadenylation  sites  are  located  at  the  ends  of  exons  24  and  33  and  are  separated  by 
approximately  40  Kbp.  Transcription  is  initiated  in  two  genomic  regions,  giving  rise  to  alternate  first 
exons  which  are  spliced  to  a  common  exon  2.  All  transcripts  contain  exons  2  to  14,  but  differ  in  their  3' 
regions.  Exon  14  can  be  spliced  alternatively  to  the  beginning  or  the  middle  of  exon  15  (CDP/Cut 
mRNAs),  or  to  exon  25  (CASP  mRNA),  generating  transcripts  with  exons  15  to  24  or  exons  25  to  33. 
Moreover,  exon  16  can  be  spliced  out  from  the  mature  transcripts  that  contain  exons  15  to  24.  Overall,  5 
distinct  transcripts  are  generated  as  a  result  of  alternative  transcription  initiation,  splicing  and 
polyadenylation. 

Months  24-36  Sequence  analysis  of  CUTLl  genomic  sequences  and  cDNAs. 

Screening  of  mutations  in  CUTLl  genomic  sequences  and  cDNAs,  using 
dideoxy  fingerprinting  (ddF). 

Confirmation  of  mutations  by  direct  sequencing. 

Our  plan  was  first  to  analyze  the  cDNA  sequence  from  a  number  of  breast  tumors  with  CUTLl 
LOH  in  order  to  verify  whether  mutations  were  present  which  would  alter  the  coding  information.  We 
were  hoping  to  identify  a  region  of  the  molecule  that  was  a  "hotspot"  for  mutations.  We  were  going  then 
to  analyze  the  same  region  in  other  breast  tumors  this  time  using  genomic  DNA  and  performing  PCR- 
sequencing  of  the  appropriate  exons.  As  mentioned  above,  so  far  no  mutation  has  been  found.  As  a 
result,  we  have  not  performed  sequence  analysis  of  CUTLl  genomic  sequences. 
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A  RT-PCR  of  CDP/Cut  mRNAs  in  Breast  Tumors 
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Fig.  3  (A)  RT-PCR  analysis  of  CDP/Cut  mRNA  in  breast  tumors 

Total  RNA  was  prepared  from  slices  of  breast  tumors  and  used  in  reverse  transcription-polymerase  chain 
reaction  (RT-PCR)  using  primers  derived  from  exon  or  intron  sequences  as  indicated.  In  the  third  panel,  a  novel 
mRNA  that  is  initiated  within  intron  20  is  revealed  using  primers  from  intron  20  and  exon  22.  Note  that  the  large 
size  of  intron  21  excludes  that  the  amplified  fragments  originate  from  genomic  DNA  or  unspliced  transcripts. 

B  RNase  Mapping  of  CDP/Cut  mRNAs 
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Nepveu,  Fig.  3 
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Fig.  4  DNA  sequence  analysis  of  CDP/Cut  cDNA  isolated  from  breast  tumors  with  CUTLl  LOH 

CDP/Cut  cDNA  fragments  were  generated  by  RT-PCR  using  primers  Fl-Bl  and  F2-B2,  as  indicated. 
The  cDNA  fragments  were  either  cloned  into  the  pKS  plasmid  or  used  as  substrates  in  PCR-sequencing  using 
a  second  set  of  primers.  For  each  cDNA  fragment  cloned,  three  individual  clones  were  analyzed  by  DNA 
sequencing.  The  regions  that  were  sequenced  are  indicated  at  the  bottom. 


Nepveu,  Fig.  4 


Task  9  Months  12-36  Preparation  of  expression  vectors  to  test  the  functionality  of  CUTLl 

products  expressed  in  breast  tumors. 

Test  the  DNA  binding  activity  of  CUTLl  products  in  breast  tumors:  perform 
transfections  and  test  cell  extracts  in  electrophoretic  mobility  shift  assays. 

Test  repression  potential  of  CUTLl  products  in  breast  tumors:  perform 
cotransfection  studies  using  appropriate  effector  and  reporter  constructs. 

As  described  in  Task  8,  using  RT-PCR,  we  have  cloned  CUTLl  cDNAs  from  5  tumors.  Since 
the  length  of  the  coding  information  is  slightly  more  than  4500  nucleotides,  the  cDNAs  were  cloned  in 
two  pieces  of  approximately  2500  bp.  PCR  primers  were  chosen  such  that  the  5'  and  3'  fragments 
would  overlap  and  include  ^e  unique  Hindlll  site  at  position  2273.  The  full  length  coding  information 
was  reconstituted  following  ligation  of  the  5'  and  3'  fragments.  These  fragments  were  then  inserted  into 
a  mammalian  expression  plasmid.  The  expression  vectors  were  introduced  into  NIH3T3  cells  by 
transient  transfection  in  conjunction  with  reporter  constructs  (either  MEC-MElal-CAT  or  p21-luciferase. 
Protein  extracts  were  prepared  and  processed  to  measure  lucifierase  activity  and  CDP/Cut  DNA  binding 
activity  by  EMSA.  All  CDP/Cut  cDNA  clones  behaved  in  a  similar  manner  to  our  "normal"  CDP/Cut 
cDNA  isolated  from  placenta.  These  results  confirmed  and  extended  our  mutation  screening  results:  not 
only  were  no  mutation  found,  but  also  the  CDP/Cut  protein  expressed  in  breast  tumors  with  LOH  of 
CUTLl  behaved  like  the  wild  type  CDP/Cut. 

New  Tasks  Added  In  July  1999  (as  per  Annual  Report  of  1999) 

In  light  of  the  results  obtained  during  the  first  two  years  of  this  grant,  we  proposed  to  add  three  new  tasks 
to  be  accomplished  during  the  last  year. 

Task  10:  Months  24-30  Expression  vectors  for  growth  promoting/suppressing  assays. 

Prepare  vectors  expressing  CDP/Cut  proteins  of  varying  lengths  and  wifii  a 
Myc  and  an  HA  epitope  tag  at  the  amino-  and  carboxy-terminal  ends, 
respectively. 

Months  30-36  Growth  promoting/suppressing  assays  in  tissue  culture 

systems.  Perform  transfections  in  immortalized  cell  lines,  selecting  for 
G418  resistance.  Count  stable  transformants,  expand  5  clones  for  each 
construct,  verify  stability  of  the  integrated  DNA,  analyze  protein  expression 
and  activity. 

Electrophoretic  mobility  shift  assays  were  performed  using  the  universal  CDP/Cut  consensus 
binding  site,  CGATATCGAT,  and  nuclear  extracts  from  NIH3T3  cells  that  had  been  transfected  with  a 
vector  expressing  the  full  length  CDP/Cut  cDNA.  Two  main  retarded  complexes  were  observed  (Fig. 
5 A,  lane  1).  Antibodies  raised  against  different  regions  of  the  CDP/Cut  protein  were  used  in 
supershifting  experiments  in  order  to  assess  the  nature  of  the  protein(s)  responsible  for  the  retarded 
complexes.  While  the  861  antibodies  supershifted  both  retarded  complexes,  the  N-term  antibodies 
supershifted  only  the  highest  complex,  suggesting  that  the  lower  complex  may  involve  a  CDP/Cut  protein 
that  is  truncated  at  its  amino-terminal  end  (Fig.  2A,  lanes  2  and  3).  In  Western  blot  analyses  with  the  N- 
term  and  861  antibodies,  several  lower  molecular  weight  species  were  observed  in  addition  to  the  full 
length  200  kDa  CDP/Cut  protein  (Fig.  5C,  lanes  1  to  4)).  Interestingly,  the  110  kDa  protein  detected 
with  the  861  antibody  was  also  present  in  293  cells,  indicating  that  tois  smaller  protein  is  normally 
expressed  in  untransfected  human  cells  (Fig.  5C,  lanes  3-4). 

To  confirm  the  above  results,  we  prepared  a  vector,  called  MCH  (Myc-Cut-HA),  expressing  the 
full  length  CDP/Cut  protein  with  Myc  and  HA  epitope  tags  at  its  amino-  and  carboxy-termini, 
respectively.  NIH3T3  cells  were  transfected  with  the  MCH  vector  and  tested  in  EMSA  with  the  universal 
CDP/Cut  consensus  binding  site.  Again,  two  main  retarded  complexes  were  observed  (Fig.  5B,  lane  1). 
The  Myc  antibody  supershifted  only  the  highest  retarded  complex,  the  HA  antibody  supershifted  both 
complexes,  whereas  an  unrelated  antibody  had  no  effect  (Fig.  5B,  lanes  2  to  4).  These  results  indicate 
that  the  protein  responsible  for  the  lowest  retarded  complex  contains  an  HA  tag  but  not  a  Myc  tag,  again 
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Fig.  5 

Two  distinct  retarded  complexes  are  generated  by  CDP/Cut  proteins,  only  one  of  which  is  supershifted  by  antibodies  raised 
against  the  n-terminal  region  of  the  protein, 

(A)  and  (B)  NIH3T3  cells  were  transfected  with  a  vector  expressing  the  CDP/Cut  protein  (A),  or  Myc-Cut-HA,  a  CDP/Cut 
protein  with  Myc  and  HA  epitope  tags  at  its  amino-  and  carboxy-terminus,  respectively  (B).  Nuclear  extracts  were  prepared  and 
analyzed  in  EMSA  with  oligonucleotides  encoding  a  universal  CDP/Cut  consensus  binding  site  (CGATATCGAT).  The  DNA  and 
proteins  were  incubated  with  either  no  antibody  (lanes  1),  or  the  indicated  antibodies  (lanes  2  to  4).  The  positions  of  the  retarded 
complexes  are  indicated  by  the  arrows.  Note  that  the  lower  retarded  complex  can  be  supershifted  with  anti-861  (A)  and  anti-HA  (B) 
antibodies,  but  not  with  anti-N-term  (A)  not  anti-Myc  (B)  antibodies.  A  diagram  of  the  protein  is  shown  indicating  the  evolutionarily 
conserved  domains  and  the  regions  recognized  by  the  respective  antibodies  (CC,  coiled-coil;  CRl,  CR2  and  CR3:  Cut  repeats  1,  2 
and  3;  HD,  homeodomain). 

(C)  Nuclear  extracts  from  transfected  NIH3T3  cells  and  untransfected  293  cells  were  analyzed  in  Western  blots  with  CDP/Cut  N- 
term,  CDP/Cut  861,  HA  and  Myc  antibodies. 


Nepveu.,  Fig.  5 


suggesting  that  this  protein  is  truncated  at  its  amino-terminal  end.  In  Western  blot  analysis,  the  HA 
antibody  revealed  two  main  proteins:  One  with  an  apparent  M.W.  of  200  kDa  corresponding  to  the  full 
length  CDP/Cut,  and  one  of  approximately  110  kDa.  The  Myc  antibody  also  revealed  two  main  proteins: 
One  of  200  kDa  and  one  of  approximately  90  kDa.  These  results  demonstrate  that  shorter  CDP/Cut 
proteins  can  be  generated  from  the  full  length  CDP/Cut  coding  sequences,  and  that  an  amino-tmncated 
CDP/Cut  protein  binds  to  the  universal  CDP/Cut  binding  site. 

To  map  the  sequences  required  for  processing,  we  generated  a  series  of  vectors  expressing 
amino-tmncated  Myc-Cut-HA  proteins  (Fig.  6A).  The  name  of  each  vector  corresponds  to  the  first 
amino  acid  encoded  by  this  vector.  Only  the  full  length  MCH  and  the  659  vectors  produced  the  1 10  kDa 
proteins  (Fig.  6A).  The  878,  958  and  1029  vector  generated  shorter  proteins.  Thus,  the  site  of  cleavage 
is  located  between  CRl  and  CR2,  somewhere  between  amino  acids  659  and  878.  In  EMSA  with  the 
ATCGAT  sequence,  the  878  and  958  vectors  generated  retarded  complexes  that  migrated  faster  than  those 
produced  by  the  full  length  MCH  vector.  In  contrast,  659  produced  two  retarded  complexes,  one 
comigrating  with  the  MCH  complex,  and  one  more  intense  that  migrated  more  slowly.  These  results 
suggest  that  the  full  length  659  protein,  in  contrast  to  the  full  length  MCH  protein,  is  able  to  bind 
efficiently  to  the  ATCGAT  sequence.  Since  the  full  length  659  protein  is  more  abundant  than  its 
processed  form  (Fig.  6A,  lane  2),  it  gives  rise  to  the  most  abundant  retarded  complex  in  EMSA.  We 
deduce  from  these  results  that  the  ATCGAT  DNA  binding  activity  is  inhibited  in  the  context  of  the  full 
length  protein,  and  that  this  inhibition  can  be  released  following  amino-terminal  tmncation  of  the  protein. 

We  then  generated  a  series  of  MCH  vectors  with  progressive  carboxy-terminal  deletions.  Since 
the  110  kDa  protein  was  produced  from  each  constmct,  we  conclude  that  the  carboxy-terminal  domain, 
the  Cut  homeodomain  and  the  Cut  repeat  3  can  be  removed  without  affecting  processing  (Fig.  6C). 
Finally,  a  constmct  encoding  only  the  Cut  repeats  1  and  2  and  the  linker  between  them  also  generated  a 
shorter  protein  of  the  expected  M.W.  (Fig.  6D).  Altogether,  our  mapping  data  indicate  that  proteolytic 
cleavage  occurs  between  a.a.  659  and  878  and  does  not  require  domains  of  the  protein  that  are  located  far 
away  from  the  cleavage  site.  Whether  sequences  between  a.a.  659  and  878  are  sufficient  for  cleavage  is 
currently  under  investigation. 

Vectors  expressing  CDP/Cut  proteins  of  various  lengths  were  cotransfected  in  HeLa  cells  together 
with  a  plasmid  conferring  G418  resistance:  MCH,  MCH659  and  MCH878  (see  Fig.  6).  Unfortunately, 
the  selection  of  stable  transfectants  has  been  plagued  with  problems  of  contamination.  This  experiment  is 
currently  being  repeated.  The  number  of  colonies  will  be  counted  and  a  number  of  clones  will  be 
expanded  for  each  constmct  to  investigate  the  expression  and  activity  of  the  recombinant  protein. 

We  compared  the  growth  properties  of  mammary  epithelial  cells  with  weak  and  strong  CDP/Cut 
DNA  binding  activity.  Four  pairs  of  human  mammary  epithelial  cell  lines  with  various  tumorigenic 
potential  were  analyzed.  Pairwise  comparison  of  breast  tumor  cell  lines  revealed  that  those  cell  lines  with 
higher  CDP/Cut  DNA  binding  activity  proliferated  faster  and  to  a  higher  saturation  density  (Fig.  7). 

Task  11  Months  24-30  GST-CDP/Cut  fusion  proteins  for  antibody  production.  Prepare 

5  fusion  proteins  containing  different  segments  of  the  CDP/Cut  protein. 

Raise  antibodies  in  rabbits. 

Months  30-36  Characterize  CDP/Cut  antibodies  in  immunoblotting, 

immunoprecipitation  and  electrophoretic  mobility  shift  assays  using  extracts 
from  our  panel  of  breast  tumor  cell  lines. 

We  have  prepared  5  fusion  proteins  containing  different  segments  of  the  CDP/Cut  protein  (Fig. 

8).  Polyclonal  antibodies  were  raised  in  rabbits  and  characterized  first  in  immunoblotting  (Fig.  8A). 
Antibodies  861  and  1300  gave  the  best  results  and  were  used  in  immunoprecipitation  and  electrophoretic 
mobility  shift  assays  using  nuclear  extracts  from  breast  tumor  cell  lines.  An  example  is  shown  in  Fig. 

8B.  Assuming  that  every  band  detected  in  Western  blot  analysis  represent  a  CDP/Cut  isoform,  these 
experiments  suggest  a  complex  pattern  of  CDP/Cut  expression.  Additional  work  will  be  required  to 
ascertain  the  identity  of  each  protein  species,  determine  the  exact  pattern  of  CDP/Cut  expression  and 
investigate  how  this  may  be  altered  in  cancer  cells. 
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Fig.  6 

Proteolytic  cleavage  occurs  downstream  of  CRl,  between  a.a.  659  and  878.  NIH3T3  cells  were 
transfected  with  a  series  of  vectors  expressing  amino-  and  carboxy-  terminally  truncated  Myc-Cut-HA 
(MCH)  proteins.  Nuclear  extracts  were  prepared  and  analyzed  in  Western  blots  with  anti-HA  antibodies 
(A)  and  in  EMSA  with  the  ATCGAT  probe  (B)  for  amino-terminally  truncated  proteins.  For  carboxy- 
terminally  truncated  Myc-Cut-HA  (MCH),  proteins  were  analyzed  in  Western  blot  with  Myc  antibody  (C). 
Both  amino-and  Carboxy-truncated  protein  CRl +2  were  also  analyzed  in  Western  blot  with  Myc  antibody 
(D).  A  diagram  of  the  proteins  encoded  by  each  construct  is  shown  at  the  bottom. 

Note  that  only  the  full  length  MCH  and  the  659  are  processed  into  a  shorter  form.  In  EMSA,  659  generates 
two  main  retarded  complexes,  the  higher  of  which  migrating  more  slowly  than  what  is  seen  with  MCH. 
This  is  because  the  non-processed  659  is  more  abundant  than  its  processed  derivative,  and  is  capable  of 
stable  binding  to  the  CGATATCGAT  probe. 


Nepveu.,  Fig.  6 
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Growth  Curves  of  Breast  Cell  Lines  with  High  and  Low  CDP/Cut  DNA  Binding  Activity 


MCF-7  and  SK-Br3 
(non-Invasive  Adenocarcinomas) 


MCF-lOA  and  MCF-12A 
(non  Tumorigenic  Lines) 
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(Invasive  Adenocarcinomas) 
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Fig.  7  Growth  Curves  of  Breast  Cell  Lines  with  High  and  Low  CDP/Cut  DNA  Binding  Activity 

Four  pairs  of  breast  cell  lines  with  similar  tumorigenic  properties  but  different  CDP/Cut  DNA 
binding  activity  were  cultured  until  saturation.  Cells  were  counted  on  the  indicated  days.  Cellular  extracts 
were  prepared  from  exponentially  growing  cells  and  used  in  electrophoretic  mobility  shift  assays  (EMSA) 
using  a  CDP/Cut  consensus  binding  site.  Invariably,  those  cell  lines  with  higher  CDP/Cut  DNA  binding 
activity  were  less  contact  inhibited  and  reached  a  higher  saturation  density. 


Nepveu,  Fig.  7 
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Fig.  8 A  Characterization  of  antibodies  raised  against  GST-CDP/Cut  fusion  proteins. 

Polyclonal  antibodies  were  raised  against  the  following  GST-CDP/Cut  fusion  proteins:  409,  GST/HS 
CDP  a.a.  409  to  550;  861,  GST/HS  CDP  a.a.  861  to  936;  1032,  GST/HS  CDP  a.a.  1032  to  1 106;  1300:  GST/HS 
CDP  a.a.  1300  to  1402;  1430:  GST/HS  CDP  a.a.  1430  to  1504.  Antibodies  were  tested  in  Western  blot  analysis 
as  indieated. 
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Fig.  8B  Western  blot  analysis  and  electrophoretic  mobility  shift  assay  (EMSA)  of  CDP/Cut  in  SkBr3  and 
MCF-7. 

Polyclonal  antibodies  861  and  13000  were  used  to  analyze  CDP/Cut  expression  and  DNA  binding 
activity  in  SkBr3  and  MCF7  cells.  Nuclear  extracts  were  prepared  and  analyzed  in  EMSA  with  oligonucleotides 
encoding  a  CDP/Cut  consensus  binding  site  (ATCGAT).  The  DNA  and  proteins  were  ineubated  with  either  no 
antibody,  or  the  indicated  antibodies.  The  positions  of  the  retarded  eomplexes  are  indieated  by  the  arrows.  Note 
that  only  the  top  two  retarded  eomplexes,  A  and  B,  are  supershifted  with  anti-861.  In  contrast,  anti-1300  can 
supershift  the  three  retarded  complex.  In  the  Western  blots  on  the  right,  some  bands  are  revealed  by  both 
antibodies,  while  others  are  seen  with  only  one  of  the  two  antibodies.  The  exaet  nature  of  each  species  remains 
to  be  determined.  Nepveu,  Fig.  8 


Task  12  Months  24-36  Investigate  expression  of  the  intron  20-initiated  CDP/Cut 

mRNA  in  breast  tumors  to  determine  whether  this  could  be  used  as  a 
marker  for  lobular  carcinoma. 

Perform  RT-PCR  analysis  on  a  panel  of  breast  tumors 

representing  various  classes:  lobular  carcinoma,  well  differentiated  ductal 
carcinoma,  moderately  differentiated  ductal  carcinoma,  and  poorly 
differentiated  ductal  carcinoma. 

Our  preliminary  data  indicated  that  the  intron  20-initiated  CDP/Cut  mRNA  was  expressed  at 
higher  level  in  two  breast  tumors:  an  invasive  lobular  carcinoma  (C8961B),  and  an  invasive  ductal 
carcinoma,  but  with  a  very  diffuse  growth  pattern  (A168)  (Fig.  9,  last  two  lanes).  We  proposed  to 
analyze  the  expression  of  full  length  and  intron  20-initiated  CDP/Cut  mRNAs  in  a  panel  of  breast  tumors 
to  verify  whether  intron  20-initiated  CDP/Cut  mRNA  was  expressed  preferentially  in  breast  tumors  with 
lobular  characteristics.  Reverse-transcriptase  polymerase  chain  reaction  (RT-PCR)  was  performed  on  30 
breast  tumors  divided  in  three  classes: 

-  invasive  lobular  carcinomas  (ILC) 

-  invasive  ductal  and  lobular  mixed  carcinomas  (IDLC) 

-  invasive  ductal  carcinomas  (IDC). 

Representative  results  are  shown  in  Fig.  9.  The  expression  pattern  was  analyzed  with  regards  to 
various  clinico-pathological  parameters:  tumor  grade,  nodal  status,  estrogen  receptor  status  and 
progesterone  receptor  status.  This  analysis  did  not  reveal  any  statistically  significant  difference  between 
low  and  high  expressers  of  intron  20-initiated  CDP/Cut  mRNA. 

We  conclude  that  expression  of  intron  20-initiated  CDP/Cut  mRNA  could  not  be  used  as  a  marker 
to  predict  the  pattern  of  growth  and  invasion  of  a  given  breast  tumor,  and  thus  has  little  value  as  a 
prognostic  marker. 
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Fig.  9  RT-PCR  Analysis  of  CDP/Cut  mRNA  expression  in  breast  tumors. 

Total  mRNA  was  isolated  from  a  panel  of  32  tumors  divided  in  three  classes:  invasive 
lobular  carcinomas  (ILC);  invasive  ductal  and  lobular  mixed  carcinomas  (IDLC);  and  invasive 
ductal  carcinomas  (IDC).  RT-PCR  was  performed  with  the  indicated  primers. 


Nepveu,  Fig.  9 


KEY  RESEARCH  ACCOMPLISHMENTS 

1 .  We  have  defined  a  second  region  of  LOH  on  the  long  arm  of  chromosome  7,  within  7q22.  These 
results  were  consistent  with  the  notion  that  CUTLl  is  a  tumor  suppressor  gene.  If  CUTLl  is  not 
the  tumor  suppressor  gene,  at  the  minimum  our  results  provide  mapping  information  that  will  be 
useful  for  an  eventual  positional  cloning  strategy. 

2 .  We  found  no  correlation  between  LOH  at  7q22  and  tumor  type  or  grade,  ER  or  PR  expression, 
nodal  status,  or  age  of  onset.  However,  there  was  a  significant  association  (p  value  =  0.0082) 
between  LOH  at  7q22  and  tumor  size.  The  average  sizes  of  breast  tumors  with  or  without  LOH  at 
7q22  were  4.99  and  2.93  cm,  respectively. 

3 .  We  demonstrated  that  CDP/Cut  mRNA  was  expressed  in  breast  tumors  with  LOH  of  CUTLl .  In  5 
breast  tumors  with  LOH  of  CUTLl ,  the  other  allele  was  found  to  be  expressed.  When  CDP/Cut 
cDNAs  were  obtained  from  these  tumors  and  expressed  in  NIH3T3,  the  CDP/Cut  proteins  encoded 
by  these  cDNAs  appeared  to  be  functional  in  EMSA  and  reporter  assays:  they  bound  DNA  and 
repress  transcription  just  like  the  wild  type  CDP/Cut  protein. 

4.  In  breast  tumor  cell  lines,  CDP/Cut  DNA  binding  did  not  correlate  with  the  level  of  full  length 
CDP/Cut  protein,  even  when  extracts  were  treated  by  phosphatase.  CDP/Cut  DNA  binding  activity 
was  found  to  be  due  to  a  protein  that  is  shorter  than  the  full  length  CDP/Cut  protein. 

5 .  We  found  no  correlation  between  c-Myc  expression  and  CDP/Cut  DNA  binding  activity. 

6.  We  have  completed  the  CUTLl  exon/intron  stmcture,  including  the  sequence  at  the  5'  and  3'  end  of 
introns. 

7 .  We  identified  a  novel  CUTLl  mRNA,  initiated  within  intron  20,  that  was  found  to  be  expressed  at 
higher  level  in  some  breast  tumors.  However,  we  found  no  correlation  between  expression  of  this 
mRNA  and  tumor  type  or  grade,  ER  or  PR  expression,  or  nodal  status. 
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*  CONCLUSIONS 

LOH  at  7q22  Occurs  in  About  18%  of  Sporadic  Breast  Cancers 

We  have  established  that  a  fraction  (~18%)  of  spontaneous  breast  cancers  exhibit  LOH  of  7q22. 
LOH  at  7q31  had  previously  been  reported  in  breast  tumors.  Our  work  has  now  identified  a  second 
region  of  LOH  on  7q,  at  band  7q22.  The  centromeric  endpoint  of  the  deletion  was  mapped  to  the  CUTLl 
gene,  between  introns  6  and  20. 

Deletion  of  genetic  material  at  7q22  was  associated  with  increased  tumor  size,  but  not  with  tumor 
progression. 

LOH  of  CUTLl  Will  Not  Have  Much  Prognostic  Value 

We  were  hoping  that  LOH  of  7q22  could  serve  as  a  prognostic  marker  that  would  be  useful  in 
predicting  the  outcome  of  node-negative  breast  cancers.  We  tested  this  idea  and,  unfortunately,  our 
results  demonstrated  that  LOH  of  7q22  correlates  only  with  increased  tumor  size.  I  therefore  conclude 
that  LOH  of  7q22  does  not  have  much  prognostic  value. 

Another  hypothesis  we  tested  was  whether  LOH  of  CUTLl ,  or  reduced  CDP/Cut  activity, 
correlated  with  increased  c-Myc  expression.  Our  results  demonstrated  that  this  is  not  the  case. 

CUTLl  Is  Likely  Not  the  Tumor  Suppressor  Gene  on  7q22 

We  have  cloned  and  analyzed  the  CUTLl  cDNA  from  a  number  of  breast  tumors.  No  mutation 
was  found. 

Work  performed  in  parallel  to  our  LOH  studies  in  breast  tumors  has  revealed  that  CDP/Cut 
activity  is  associated  with  cellular  proliferation.  In  particular,  in  fibroblastic  cells  CDP/Cut  DNA  binding 
activity  was  found  to  be  minimal  in  GO,  while  it  was  activated  at  the  end  of  G1  and  was  maximal  during 
S  phase  (12).  One  target  of  CDP/Cut  repression  was  shown  to  be  the  gene  encoding  for  the 
p2iWAFi/ciPi/SDii  cyclin  kinase  inhibitor  which,  when  expressed  at  high  levels,  can  block  cell  cycle 
progression  in  Gl.  Moreover,  we  compared  the  growth  properties  of  mammary  epithelial  cells  with 
we^  and  strong  CDP/Cut  DNA  binding  activity.  Four  pairs  of  human  mammary  epithelial  cell  lines  with 
various  tumorigenic  potential  were  analyzed.  Pairwise  comparison  of  breast  tumor  cell  lines  revealed  that 
those  cell  lines  with  higher  CDP/Cut  DNA  binding  activity  proliferated  faster  and  to  a  higher  saturation 
density. 

The  CUTLl  Gene  Is  Large  and  Several  Mechanisms  Lead  to  the  Expression  of  Multiple 
Protein  Species 

We  have  determined  the  complete  CUTLl  exon/intron  structure,  including  the  sequence  at  the  5' 
and  3'  end  of  introns.  The  gene  spans  at  least  340  Kbp  and  contains  33  exons.  Synthesis  of  five 
different  transcripts  involves  2  promoter  regions,  2  polyadenylation  sites  and  7  alternative  splicing 
events.  In  Western  blot  analysis,  a  number  of  protein  species  are  visible  with  different  CDP/Cut 
antibodies.  Although  many  of  these  species  remain  to  be  fully  characterized,  our  work  so  far  has 
revealed  that  at  least  one  of  these  proteins  is  generated  by  proteolytic  processing  of  the  full  length 
CDP/Cut  protein,  while  another  protein  is  the  product  of  a  mRNA  initiated  within  intron  20. 

A  Shorter  CDP/Cut  Protein  Is  Responsible  for  CDP/Cut  DNA  Binding  Activity 

In  breast  tumor  cell  lines,  CDP/Cut  DNA  binding  did  not  correlate  with  the  level  of  full  length 
CDP/Cut  protein,  even  when  extracts  were  treated  by  phosphatase.  These  findings  led  us  to  investigate 
further  the  relationship  between  CDP/Cut  protein  expression  and  DNA  binding  activity.  We  have 
demonstrated  that  the  CDP/Cut  protein  can  be  proteolytically  processed  to  generate  an  amino-terminally 
truncated  protein  of  ~1 10  kDa.  This  protein  is  responsible  for  the  lower  of  the  two  retarded  complexes 
observed  using  a  probe  encoding  a  Cut  binding  site.  The  higher  retarded  complex  involves  the  full  length 
CDP/Cut  protein.  The  molecular  function  and  biological  activity  of  the  truncated  110  kDa  protein  is 
currently  being  investigated. 

Two  Alternative  Mechanisms  May  Lead  to  the  Production  of  Shorter  CDP/Cut  Proteins 

Processing  of  the  full  length  CDP/Cut  protein,  however,  is  not  the  only  mechanism  leading  to  the 
expression  of  shorter  CDP/Cut  proteins.  In  placenta,  which  is  composed  largely  of  trophoblasts 
undergoing  endomitosis,  a  novel  mRNA  was  found  to  be  initiated  within  intron  20.  This  mRNA  encodes 
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for  a  short  CDP/Cut  protein  corresponding  to  a  little  less  than  the  carboxy-terminal  half.  Using  RT-PCR, 
we  analyzed  expression  of  full  length  and  intron  20-initiated  CDP/Cut  mRNAs  in  a  panel  of  breast  tumors 
comprising  lobular  and  ductal  carcinomas.  The  expression  pattern  was  analyzed  with  regards  to  various 
clinico-pathological  parameters:  tumor  grade,  nodal  status,  estrogen  receptor  status  and  progesterone 
receptor  status.  This  analysis  did  not  reveal  any  statistically  significant  difference  between  low  and  high 
expressers  of  intron  20-initiated  CDP/Cut  mRNA. 

We  conclude  that  expression  of  intron  20-initiated  CDP/Cut  mRNA  could  not  be  used  as  a  marker 
to  predict  the  pattern  of  growth  and  invasion  of  a  given  breast  tumor,  and  thus  has  little  value  as  a 
prognostic  marker. 
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chromosome  7  in  human  hreast  cancer  defines  the  location  of  a  second 
tumor  suppressor  gene  at  7q22  in  the  region  of  the  CUTLl  gene 
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In  breast  cancer,  loss  of  heterozygosity  (LOH)  has  been 
described  on  the  long  arm  of  chromosome  7,  at  band  q31, 
suggesting  the  presence  of  a  tumor  suppressor  gene  in 
this  region.  In  this  study,  we  have  identified  a  second 
region  of  LOH  on  7q,  at  band  7q22.  Deletion  of  genetic 
material  at  7q22  was  found  in  all  tumor  types  and  grades 
and  was  associated  with  increased  tumor  size.  The  region 
of  LOH  at  7q22  in  every  case  included  one  or  more  of 
three  polymorphic  markers  that  are  located  within  the 
CUTLl  gene.  LOH  of  7q22  has  also  been  documented  in 
the  case  of  human  uterine  leiomyomas  (Zeng  et  al.,  1997; 
Ishwad  et  al.,  1997).  Interestingly,  in  both  leiomyomas 
and  mammary  tumors  induced  in  transgenic  mice 
expressing  the  Polyomavirus  (PyV)  large  T  (LT) 
antigen,  immunocomplexes  of  CUTLl  and  PyV  LT 
antigen  were  detected  (Webster  et  al.,  1998).  Altogether, 
genetic  data  in  human  breast  cancer  and  biochemical 
analyses  in  breast  tumors  from  transgenic  mice  suggest 
that  CUTLl  is  a  candidate  tumor  suppressor  gene. 

Keywords:  breast  cancer;  tumor  suppressor  gene; 
chromosome  7q22;  CUTLl;  human  Cut  homeobox 
gene;  loss  of  heterozygosity 


Introduction 

Breast  cancer  accounts  for  almost  30%  of  cancer 
diagnoses  and  almost  20%  of  cancer-related  deaths  in 
women.  It  is  the  leading  cause  of  death  among  women 
aged  40-55.  In  the  United  States,  it  affects  more  than 
one  in  ten  women  (Harris  et  al.,  1992).  Although  little 
is  known  about  the  etiology  and  pathogenesis  of  breast 
neoplasias,  alterations  of  oncogenes  and  tumor 
suppressor  genes  are  considered  to  be  critical  in  the 
multistep  process  leading  to  the  development  of  breast 
cancer. 

In  human  breast  cancer,  genetic  deletions  have  been 
demonstrated  to  be  one  of  the  major  genetic 
abnormalities.  LOH  has  been  documented  at  several 
chromosomal  locations,  including  Ip,  Iq,  2p,  3p,  6q, 
7q,  8q,  9q,  Up,  llq,  13q,  15q,  16q,  17p,  17q,  18p,  18q 
and  22q  (reviewed  by  Bieche  and  Lidereau,  1995).  A 
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few  of  these  regions  of  LOH  have  been  shown  to 
include  a  known  tumor  suppressor  gene  implicated  in 
breast  cancer,  such  as  p53  at  chromosome  17ql3  (Isobe 
et  al.,  1986;  McBride  et  al.,  1986;  Miller  et  al.,  1986), 
BRCA-l  at  17q21  (Miki  et  al.,  1994)  and  BRCA-2  at 
13ql2-13  (Wooster  et  al.,  1995).  Genome-wide  LOH 
analysis  to  generate  allelotypes  of  human  breast  cancer 
revealed  that  several  chromosomal  regions  could  be 
deleted  in  a  single  tumor  but  no  common  sets  of 
deletions  could  be  identified  between  different  tumors 
(Devilee  et  al.,  1997;  Kerangueven  et  al.,  1997;  Larsson 
et  al.,  1990;  Sato  et  al.,  1990). 

From  cytogenetic  studies,  deletions  within  the  long 
arm  of  chromosome  7  have  been  found  in  breast 
cancer  and  other  types  of  tumors,  including  myeloid 
leukemias  (reviewed  in  Fischer  et  al.,  1998),  kidney 
carcinomas,  colon  carcinomas,  ovarian  carcinomas, 
lung  carcinomas,  head-and-neck  carcinomas  (Berker- 
karauzum  et  al.,  1998;  Dave  et  al.,  1995;  Lundgren, 
1991;  Lundgren  et  al.,  1992;  Mertens  et  al.,  1997; 
Solinas-Toldo  et  al.,  1996;  Storto  et  al.,  1990)  and 
uterine  leiomyomas  (Ozisik  et  al.,  1993;  Sargent  et  al., 
1994;  Xing  et  al.,  1997).  LOH  analyses  confirmed  that 
genetic  material  on  7q  is  frequently  deleted  in  breast 
cancer,  particularly  in  the  7q31  region  (Bieche  et  al., 
1997;  Callahan  et  al.,  1993;  Champeme  et  al.,  1995; 
Deng  et  al.,  1994;  Devilee  et  al.,  1997;  Kristjansson  et 
al.,  1997;  Lin  et  al.,  1996;  Tougas  et  al.,  1996). 
Altogether  these  data  have  been  taken  to  suggest  that 
a  tumor  suppressor  gene  is  located  in  this  region, 
however,  this  gene  remains  to  be  identified. 

In  previous  studies,  the  loss  of  heterozygosity  (LOH) 
at  7q22  has  been  reported  in  a  subset  of  uterine 
leiomyomas  (Ishwad  et  al.,  1997;  Zeng  et  al.,  1997), 
which  are  benign  tumors  of  smooth  muscle  origin  most 
often  referred  to  as  fibroids.  The  smallest  commonly 
deleted  region  in  leiomyomas  included  markers  that  are 
located  within  CUTLl  (Cut-Hke-l),  a  gene  that  is 
homologous  to  the  Drosophila  melanogaster  cut  gene. 
Genetic  studies  in  Drosophila  melanogaster  indicated 
that  cut  is  involved  in  determination  and  maintenance 
of  cell  identity  in  several  tissues.  CUTLl  encodes  a 
homeodomain  transcription  factor  that  functions  as  a 
transcriptional  repressor.  In  humans,  the  Cut  protein 
was  first  identified  as  the  CCAAT  displacement  protein 
(CDP),  a  repressor  of  the  gp-91  phox  gene  in 
undifferentiated  myeloid  cells  (Skalnik  et  al.,  1991).  A 
CUTLl  cDNA  was  also  isolated  by  screening  a  cDNA 
expression  library  within  a  probe  derived  from  the 
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c-Myc  gene  promoter  (Dufort  and  Nepveu,  1994).  The 
human  Cut  protein  was  shown  to  bind  to  the  c-Myc 
promoter  and  repress  its  expression  (Harada  et  al., 
1994;  Mailly  et  al.,  1996).  Other  cwt-related  cDNAs 
have  been  isolated  from  several  mammalian  species 
including  dog,  mouse  and  rat  and  were  respectively 
termed  Clox  (Cut-Hke  homeoboxi  Cux-1  and  Cux-2 
(Cut  homeobox)  and  CDP-2  (Andres  et  al.,  1992; 
Dufort  and  Nepveu,  1994;  Neufeld  et  al.,  1992; 
Quaggin  et  al.,  1996;  Valarche  et  al.,  1993;  Yoon  and 
Chikaraishi,  1994). 

Female  transgenic  mice  expressing  the  Polyomavirus 
(PyV)  Large  T  (LT)  antigen  under  the  control  of  the 
mouse  mammary  tumor  virus  long  terminal  repeat 
(MMTV-LTR)  frequently  develop,  in  addition  to 
mammary  tumors,  uterine  leiomyomas  (Webster  et 
al.,  1998).  Since  we  had  identified  LOH  at  CUTLl  in 
human  uterine  leiomyomas,  we  examined  whether  PyV 
LT  antigen  formed  specific  complexes  not  only  with 
members  of  the  retinoblastoma  (Rb)  family  (plOSRb, 
pl07,  pl30),  but  also  with  the  mammalian  Cut  protein. 
The  results  of  coimmunoprecipitation  analyses  revealed 
that  specific  complexes  of  Cut  and  PyV  LT  antigen 
could  be  detected  in  both  leiomyomas  and  mammary 
tumors.  The  existence  of  such  complexes  suggested  that 
the  alterations  in  the  function  of  the  Cut  protein  may 
be  an  important  event  in  the  etiology  of  breast  cancer. 
These  results  lead  us  to  hypothesize  that  genetic 
alterations  within  the  7q22  region  may  also  occur  in 
human  breast  tumors.  We  report  here  that  LOH  at 
7q22  is  observed  in  a  subset  of  sporadic  breast  cancers 
and  is  associated  with  increased  tumor  size. 


Results 

A  subset  of  breast  tumors  exhibits  LOH  of  7q22 
andjor  7q3t 

The  present  study  was  initiated  following  the  observation 
that  genetic  material  at  7q22  is  deleted  in  a  fraction  of 
uterine  leiomyomas.  The  smallest  deleted  region  at  7q22 
included  three  polymorphic  markers  which  since  then 
have  been  located  within  the  CUTLl  gene  (Figure  2) 
(Ishwad  et  al.,  1997;  Zeng  et  al.  1997;  and  Zeng  et  al., 
manuscript  in  preparation).  Secondly,  the  product  of  the 
CUTLl  gene  was  found  to  associate  with  the  PyV  LT 
oncoprotein  both  in  uterine  leiomyomas  and  in  breast 
tumors  that  arise  at  high  frequency  in  MMTV-PyV  LT 
transgenic  mice  (Webster  et  al.,  1998),  We  thus 
postulated  that  genetic  alterations  within  the  7q22 
region  may  occur  not  only  in  human  uterine  leiomyo¬ 
mas  but  also  in  human  breast  tumors. 

As  a  first  step  to  verify  whether  LOH  of  7q22  may 
occur  in  breast  cancer,  we  analysed  breast  tumor  DNA 
samples  that  had  previously  been  characterized 
regarding  LOH  of  7q31,  the  chromosomal  region 
adjacent  to  7q22.  We  and  others  have  shown  that 
within  7q22  the  marker  D7S518  is  the  most  frequently 
deleted  in  uterine  leiomyomas.  Thus,  we  first  asked 
whether  breast  tumors  with  or  without  LOH  of  7q31 
could  exhibit  LOH  of  D7S518.  This  marker  has  been 
mapped  to  intron  20  of  the  CUTLl  gene  (Zeng  et  al., 
1997;  and  Zeng  et  al.,  manuscript  in  preparation).  A 
total  of  33  pairs  of  samples  from  Bieche  et  al.  (1997) 
study  were  analysed.  LOH  of  D7S518  was  found  in  five 


of  ten  cases  with  LOH  of  7q31  and  in  one  out  of  23 
cases  without  LOH  of  7q31  (Table  1).  A  total  of  63 
pairs  of  samples  from  the  Lin  et  al.  (1996)  study  were 
analysed.  Among  nine  pairs  of  samples  with  LOH  of 
7q31,  seven  were  informative  for  D7S518  and  four 
exhibited  LOH  of  this  marker  (Table  1).  Among  54 
pairs  of  samples  without  LOH  of  7q31,  two  had 
suffered  LOH  of  D7S518.  Results  obtained  with  these 
two  sets  of  samples  demonstrated  that  among  breast 
tumors  with  LOH  of  the  7q31  region,  53%  (9/17)  also 
exhibit  LOH  of  the  D7S518  marker  in  7q22.  It  is  likely 
that  the  deleted  region  in  these  tumors  is  large  and 
encompasses  both  7q22  and  7q31.  On  the  other  hand, 
among  breast  tumors  without  LOH  of  7q31,  a  small 
fraction  suffered  LOH  of  D7SD518.  Thus,  LOH  of 
D7S518  can  be  observed  independently  of  LOH  at 
7q3L  We  conclude  that  the  long  arm  of  chromosome  7 
contains  at  least  two  tumor  suppressor  genes  that  can 
be  inactivated  in  breast  cancers,  one  at  7q22  and  one  at 
7q3L 

LOH  of  7q22  and  7q31  in  tumors  from  the 
NCIC-Manitoba  breast  tumor  bank 

To  establish  the  incidence  and  extent  of  7q22  deletions, 
we  investigated  LOH  of  7q22  in  samples  from  the 
National  Cancer  Institute  of  Canada  (NCIC)-Manito- 
ba  Breast  Tumor  Bank  (Watson  et  al.,  1996).  We  have 
analysed  66  pairs  of  tumors  and  adjacent  normal 
control  tissues  representative  of  all  classes  of  breast 
tumors.  At  least  three  microsatellite  markers  within 
7q22,  D7S518,  D7S515  and  D7S666,  were  used  in 
all  cases.  In  a  previous  study,  we  have  mapped  the 
marker  D7S518  within  an  intron  of  the  CUTLl  gene 
and  the  markers  D7S515  and  D7S666  close  to  the  5' 
end  of  this  gene  (Zeng  et  at.,  1997).  Determination  of 
the  CUTLl  exon/intron  structure  and  DNA  sequen¬ 
cing  of  the  long  arm  of  chromosome  7  has  since 
revealed  that  all  three  markers  are  located  within 
the  CUTLl  gene,  in  introns  20,  3  and  6  respectively 
(Figure  2)  (Zeng  el  al.,  manuscript  in  prepara¬ 
tion;  http://www.ncbi.  nlm.nih.gov,  accession  number 
AF024533  and  AF047825).  Tumors  with  LOH  of  one 
or  more  of  these  markers  were  then  further  analysed  to 
map  the  boundaries  of  the  deleted  regions.  The 
physical  order  of  all  of  the  microsatellite  markers  and 
CUTl  with  respect  to  each  other  have  previously  been 
established  (Zeng  et  al.,  1997).  The  number  of  cases 
studied  for  each  marker,  the  number  of  informative 


Table  1  Loss  of  heterozygosity  of  D7S518  in  breast  tumors  with  or 
without  LOH  of  7q31 


Sources  of 
samples 

7q31  status  in 
breast  cancers 

Samples 

analysed 

Informative 
cases  for 
D7S518 

LOH  of 
D7SSI8 

Bieche  et  al. 

LOH  of  7q31 

10 

10 

5  (50%) 

Study 

Retention  of  7q31 

23 

23 

1  (4%) 

Lin  et  al 

LOH  of  7q31 

9 

7 

4  (57%) 

Study 

Retention  of  7q31 

54 

54 

2  (3.7%) 

DNAs  from  breast  cancers  and  matched  normal  tissues  were 
amplilied  from  patients  with  breast  cancers  using  oligonucleotide 
primers  for  the  polymorphic  marker  D7S518.  The  source  of  samples 
is  indicated,  together  with  the  numbers  of  patients  tested,  informative 
patients  and  patients  with  LOH  for  markers  on  7q31  or  7q22.  The 
information  regarding  LOH  at  7q31  was  derived  from  the  original 
studies 
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cases,  and  the  number  and  percentage  of  cases  that 
exhibit  LOH  are  given  in  Table  2  and  shown  in  Figure 
2.  Representative  LOH  results  are  shown  in  Figure  1. 
LOH  was  scored  with  at  least  one  7q22  marker  in  12  of 
66  (18.2%)  of  the  tumors  examined  (patients  10544, 
11305,  93-4635,  93-5199,  93-11232,  93-11747,  93-12017, 
93-18747,  94-1663,  94-3808,  94-5629  and  94-133582,  see 
Figure  2).  The  superposition  of  the  overlapping 
deletions  in  the  12  tumors  revealed  two  common 
regions  of  deletion,  one  in  7q22  and  encompassing  the 
CUTLl  gene,  and  one  in  7q31  (Figure  2).  The 
proximal  (centromeric)  and  distal  (telomeric)  bound¬ 
aries  of  the  critical  region  in  7q22  were  defined  by 
breakpoints  in  tumors  #10544,  11305,  and  #93-5199 
that  were  flanked  by  the  D7S666  and  D7S658  markers, 
respectively  (Figure  2).  The  proximal  (centromeric)  and 
distal  (telomeric)  boundaries  of  the  critical  region  in 
7q31  were  flanked  by  the  D7S480  and  D7S650 
markers,  respectively  (Figure  2). 

Association  between  LOH  at  7q22  and  clinical 
parameters 

To  determine  whether  loss  of  genetic  material  at  7q22 
or  7q31  may  be  associated  with  pathological  features 
of  breast  tumors,  we  analysed  available  clinical  data 
including  the  tumor  type,  grade  and  size,  estrogen  and 
progesterone  receptor  (ET  and  PR)  status,  occurrence 
of  lymph  node  metastasis  and  age  of  onset.  Statistical 
analysis  was  performed  using  the  Mann -Whitney  test 
(Mann  and  Whitney,  1947).  We  found  no  correlation 
between  LOH  at  7q22  and  tumor  type  or  grade,  ER  or 
PR  expression,  nodal  status,  or  age  of  onset.  However, 
there  was  a  significant  association  {P  value  =  0.0082) 
between  LOH  at  7q22  and  tumor  size  (Table  3).  The 
average  sizes  of  breast  tumors  with  or  without  LOH  at 
7q22  were  4.99  and  2.93  cm  respectively.  The  difference 
in  size  was  further  accentuated  in  the  group  of  tumors 
with  LOH  at  7q22  but  retention  of  7q31  (6.5  Vs 
2.99  cm;  P  value  =  0.0074).  In  contrast,  no  significant 
correlation  was  found  between  LOH  at  7q31  and  any 
clinical  feature. 


Table  2  LOH  analysis  of  66  breast  cancers  using  11  polymorphic 
markers  on  chromosome  7 


Markers 

Patients  tested 

Informative 

patients 

Patients 
with  LOH 

D7S524 

12 

5 

2 

D7S527 

3 

2 

0 

D7S518 

66 

59 

6 

D7S666 

66 

47 

4 

D7S515 

66 

48 

8 

D7S658 

5 

3 

0 

D7S471 

1 

1 

0 

D7S486 

9 

8 

3 

D7S522 

66 

45 

3 

D7S480 

66 

61 

7 

D7S650 

66 

48 

7 

DNAs  from  tumors  and  matched  normal  peripheral  breast  tissues 
were  amplified  from  66  patients  with  breast  cancers  using 
oligonucleotide  primers  for  11  polymorphic  markers  on  chromsomc 
7q.  The  list  of  markers  is  presented,  together  with  the  numbers  of 
patients  tested,  informative  patients  and  patients  with  LOH  for  each 
marker.  The  level  of  informativeness  observed  for  each  of  the 
markers  in  our  cohort  of  patients  was  consistent  with  the  published 
values 


LOH  within  the  7q31  region  has  previously  reported  in 
human  breast  carcinomas,  and  in  several  cases  the 
deletions  encompassed  the  adjacent  region,  7q22 
(Bieche  et  al.,  1992,  1997;  Champeme  et  al.,  1995; 
Deng  et  al,  1994;  Lin  et  al.,  1996;  Zenklusen  et  al., 
1994).  The  results  of  the  present  study  establish  that  in 
some  breast  tumors  the  loss  of  genetic  material  on  the 
long  arm  of  chromosome  7  can  be  limited  to  the  7q22 
region.  Thus,  regarding  chromosomal  deletions  on  7q, 
collectively  the  available  data  define  three  classes  of 
breast  tumors  on  the  basis  of  whether  the  LOH  region 
encompasses  markers  in  only  7q22,  only  7q31  or  both 
7q22  and  7q31  (Bieche  et  al.,  1992,  1997;  Champeme  et 
al.,  1995;  Deng  et  al.,  1994;  Lin  et  al.,  1996;  Zenklusen 
et  al.,  1994).  These  results  suggest  that  7q  contains  at 
least  two  tumor  suppressor  genes  that  can  be  inactivated 
in  breast  cancers,  one  at  7q22  and  one  at  7q31. 

Three  sets  of  tumor  samples  originating  from 
different  research  centers  have  been  analysed  in  this 
study.  LOH  of  7q22  and  7q31  have  been  found  in 
tumors  from  each  set,  however,  there  was  considerable 
variation  in  the  proportion  of  LOH  between  the  three 
sets  especially  for  markers  at  7q31.  This  variation  is 
unlikely  to  be  due  to  the  differences  in  ethnic  group 
composition  of  each  set  since  the  patients  in  all  sets 
were  almost  exclusively  of  Caucasian  origin.  We  think 
that  differences  in  LOH  frequencies  can  be  attributed 
mainly  to  the  faet  that  we  have  been  very  conservative 
in  our  appreciation  of  LOH.  In  particular,  we  did  not 
count  as  LOH  any  sample  where  both  alleles  in  the 
normal  control  were  not  clear.  This  bias  almost 
certainly  led  us  to  underestimate  LOH  frequencies. 

There  was  no  correlation  between  LOH  at  7q22  and 
tumor  grade,  suggesting  that  the  loss  of  genetic 
material  at  7q22  is  probably  an  early  event  in  tumor 
development  and,  most  likely,  is  not  associated  with 
tumor  progression.  On  the  other  hand,  LOH  of  7q22 
was  associated  with  increased  tumor  size,  raising  the 
possibility  that  the  tumor  suppressor  gene  at  7q22 
could  function  to  restrict  cellular  proliferation. 

The  smallest  commonly  deleted  region  on  7q22 
includes  polymorphic  markers  that  are  all  located 
within  the  CUTLl  gene.  Therefore,  the  tumor 
suppressor  gene  at  7q22  either  is  CUTLl  or  is  located 
close  to  it.  Interestingly,  in  breast  tumors  that  arise  in 
MMTV-PyV  LT  transgenic  mice,  the  murine  Cut 
protein  was  found  to  form  a  complex  together  with 
the  PyV  LT  antigen  (Webster  et  al.,  1998).  Moreover, 
in  cotransfection  studies,  the  CUTLl  gene  product  was 
also  coimmunoprecipitated  together  with  the  SV40 
Large  T  antigen  (SV40  LT)  (Martin  et  al.,  manuscript 
in  preparation).  At  present  the  effect  of  Large  T 
oncoproteins  on  Cut  function  is  not  clear.  However, 
these  viral  oncoproteins  have  previously  been  found  to 
inactivate  the  function  of  the  p53  and  pRB  tumor 
suppressor  proteins.  Thus,  interactions  with  these  viral 
oncoproteins  suggest  that  Cut  proteins  may  play  an 
important  role  in  the  control  of  cellular  proliferation. 
This  hypothesis  received  further  support  from  the 
finding  that  Cut  DNA  binding  activity  is  regulated  in  a 
cell-cycle  dependent  manner  (Coqueret  et  al.,  1998). 
Whereas  Cut  was  expressed  in  all  phases  of  the  cell 
cycle.  Cut  DNA  binding  activity  was  the  highest  at  the 
end  of  G1  and  during  S  phase. 
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In  conclusion,  two  sets  of  data,  LOH  mapping 
analysis  and  protein-protein  interaction  studies, 
strongly  point  towards  the  CUTLl  gene  as  a 
candidate  tumor  suppressor  gene.  It  should  be 
stressed,  however,  that  the  CUTLl  gene  appears  to 


cover  a  very  large  distance,  over  200  Kbp,  and  that 
several  introns  are  more  than  10  Kbp  (Zeng  et  al., 
manuscript  in  preparation).  It  is  thus  possible  that 
another  transcription  unit  exists  within  the  boundaries 
of  the  CUTLl  gene.  In  accordance  with  the  recently 
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Figure  1  Representative  PCR  amplifications  of  (CA)n  microsatellite  repeats.  DNAs  from  tumors  (T)  and  matched  normal  breast 
tissues  (N)  were  analysed  from  66  patients  with  breast  cancer.  Representative  PCR  amplifications  of  (CA)n  microsatellite  repeats 
D7S518,  D7S666,  D7S524,  D7S515,  D7S522,  D7S480,  D7S650,  D7S658  are  shown.  Oligonucleotide  primers  were  used  to  PCR 
amplify  the  regions  of  DNA  containing  these  markers,  in  the  presence  of  radiolabeled  dCTP.  Products  were  denatured  and 
separated  on  a  standard  5%  sequencing  gel.  A  patient  is  considered  to  be  informative  if  there  are  two  major  bands  (corresponding 
to  two  alleles)  in  the  normal  DNA  lane.  A  patient  shows  LOH  if,  in  the  tumor  DNA  lane,  one  of  the  alleles  is  absent  or  shows 
diminished  intensity.  For  example,  patient  10544  is  informative  for  the  marker  D7S515,  and  shows  LOH  at  that  locus,  but  shows  no 
LOH  at  locus  D7S658  and  patient  11305  is  uninformative  for  marker  D7S658 
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proposed  rules  for  the  definition  of  a  tumor  suppressor 
gene,  a  firm  statement  about  the  identity  of  the  tumor 
suppressor  gene  at  7q22  will  have  to  await  the 
demonstration  either  that  the  remaining  allele  is 
mutated  or  that  the  function  of  the  CUTLl  gene 
product  is  inactivated  or  altered  in  some  breast  tumors 
(Haber  and  Harlow,  1997).  At  the  minimum,  the  fact 
that  a  polymorphic  marker  within  CUTLl  is  consis¬ 
tently  deleted  in  breast  tumors  with  LOH  of  7q22  will 
provide  a  useful  start  point  for  positional  cloning 
approaches  to  identify  the  critical  tumor  suppressor 
gene  at  7q22. 


Sixty-six  pairs  of  samples  from  NCIC-Manitoba  Breast 
Tumor  Bank  have  been  analysed  in  this  study.  All  tumor 
samples  selected  from  the  Manitoba  Breast  Tumor  Bank 
were  high  quality  tissues  selected  to  ensure  >30% 
epithelial  tumor  component  with  minimal  contaminating 
normal  tissues,  and  matching  normal  tissues  were  also 
histologically  verified.  All  tissue  histological  assessment 
was  performed  uniformly  by  a  single  pathologist  (PW)  and 
conducted  on  high  quality  paraffin  sections  from  the  face 
of  the  tissue  block,  thus  ensuring  consistency  in  assessing 
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Figure  2  Mapping  of  LOH  within  7q.  (Left)  Representation  of  human  chromosome  7  including  band  assignments.  (Right)  Names 
of  polymorphic  markers  used  are  given,  along  with  chromosome  band  assignments  and  the  position  of  the  CUTLl  gene.  Twelve 
columns  representing  12  patients  with  a  loss  of  heterozygosity  on  chromosome  7q22  arc  shown.  Explanation  of  the  symbols  used  is 
given.  The  smallest  common  deletion  in  these  seven  patients  is  indicated  by  the  vertical  line  to  the  right.  All  54  tumors  not  shown  in 
this  figure  were  informative  for  at  least  one  marker  within  the  critical  region  but  failed  to  show  LOH 
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Table  3  Relationship  between  allelic  loss  for  loci  at  7q22  and  7q31  and  clinicopathological  characteristics  of  the  66  breast  tumors 


Clinical-pathological  characteristics 

+ 

LOH  7q22 

Allelic  loss  for  loci  at  7q 

P  value  + 

LOH  7q31 

P  value 

Type 

Ductal 

10 

46 

9 

47 

Lobular 

1 

4 

ns 

1 

4 

ns 

Other 

sa 

CO,  pa,  is,  is 

sa 

CO,  pa,  is,  is 

Grade 

well 

0 

10 

0 

10 

mod 

6 

22 

7 

21 

poor 

5 

20 

ns 

3 

22 

ns 

not  graded 

1 

2 

1 

2 

ER 

positive 

7 

36 

ns 

8 

36 

ns 

negative 

4 

17 

3 

18 

PR 

positive 

5 

24 

4 

23 

negative 

6 

29 

ns 

7 

31 

ns 

not  assayed 

1 

4 

1 

1 

LN  status 

positive 

4 

17 

2 

17 

negative 

8 

32 

ns 

9 

34 

ns 

unknown 

- 

4 

- 

4 

Size 

mean  (ems) 

4.99 

2.93 

0.0082 

2.77 

3.45 

ns 

Age 

mean  (yrs) 

53 

59 

ns 

55 

59 

ns 

The  statistical  analysis  was  performed 

using  the 

Mann-Whitney  test. 

Grade  ==  Nottingham  grade, 

Sa  =  sarcoma,  co  = 

colloid  carcinoma, 

pa  =  papillary 

carcinoma,  is  =  predominant  ductal  carcinoma  in  situ  with  small  ductal 

invasive  foci,  ns  = 

not  significant,  ER 

=  estrogen  receptor, 

PR  —  progesterone  receptor,  LN  =  lymph  node 


histological  features  (Watson  et  al.,  1996).  Samples  from 
the  Lin  et  al.  (1996)  study  were  obtained  from  two  different 
sources:  43  from  the  Surgical  Pathology  Laboratory  of  the 
Royal  Victoria  Hospital,  Montreal,  Canada  and  20  from 
the  Mount  Sinai  Hospital,  Toronto,  Canada.  Samples  from 
the  Bieche  study  were  obtained  from  the  Centre  Rene 
Huguenin,  Saint-Cloud,  France  (Bieche  et  al.,  1997). 

Genomic  DNA  extraction  from  paraffin  blocks 

The  basic  protocol  has  been  reported  before  (Watson  et  al., 
1993).  Twenty  pm  sections  were  cut  and  placed  in  1.5  ml 
Eppendorf  tubes.  This  material  was  then  subjected  to  two 
xylene  washes  and  two  ethanol  washes  (100  and  95%), 
incubated  for  1-3  days  in  lysis  buffer  (800  /rg/ml 
proteinase  K,  0.1  M  Tris  (pH  8)  and  0.004  M  EDTA), 
followed  by  heat  inactivation  (boiling)  and  ethanol 
precipitation.  The  quantity  or  quality  of  DNA  of  66  pairs 
of  samples  were  good  enough  for  the  following  micro¬ 
satellite  repeat  analysis  of  all  markers. 

Microsatellite  repeat  analysis 

DNA  from  tumors  and  matched  control  tissues  were 
characterized  by  polymerase  chain  reaction  (PCR)  analysis 
using  a  pair  of  oligonucleotide  primers  for  each  poly¬ 
morphic  marker.  Ten  (CA)„  repeat  microsatellite  markers 
on  chromosome  7  were  used  to  identify  the  region  of  loss 
of  chromosome  7  in  breast  cancers:  D7S524,  D7S527, 
D7S515,  D7S666,  D7S518,  D7S471,  D7S486,  D7S522, 
D7S480,  D7S650.  Information  of  the  primer  sequences 
and  allele  lengths  are  available  in  the  Genome  Database 
(GDB).  Some  of  these  markers,  D7S527,  D7S658  and 
D7S471,  have  been  used  only  with  a  subset  of  samples  in 
order  to  define  more  precisely  the  deleted  region.  To 
enhance  the  sensitivity  of  the  assay  and  to  limit  the  number 
of  amplification  cycles,  radioactively  labeled  deoxyribonu- 
cleotides  have  been  included  in  the  reaction.  The  PCR 
amplification  conditions  were  similar  to  that  reported 
before  (Zeng  et  al.,  1997).  The  (CA)„  repeats  were 
amplified  by  PCR  in  a  final  volume  of  50  pi,  containing 
50  ng  DNA,  1.5  mM  MgCU,  5  p\  standard  10  x  PCR  buffer 
(200  mM  Tris-HCl  pH  8.4,  500  mM  KCl),  0.4  pM  of  each 
primer,  0.125  mM  dNTPs,  1  pCi  [a-“P]dCTP  and  1  unit  of 
Taq  polymerase  (Gibco/BRL,  Burlington,  Ontario).  An 
initial  step  of  5  min  at  95°C  was  followed  by  35  cycles  of 


1  min  denaturation  at  94'’C  30  s  of  annealing  at  59°C  and 
1.5  min  of  extension  at  72°C,  followed  by  a  final  extension 
step  of  7  min  at  12°C.  Pilot  experiments  with  different  PCR 
cycle  numbers  indicated  that  35  cycles  was  still  within  the 
linear  amplification  range.  PCR  products  were  separated  in 
a  6%  sequencing  gel  containing  formamide  or  a  conven¬ 
tional  6%  sequencing  gel. 

Determination  of  LOH 

Allelic  loss  was  scored  only  on  informative  patients  whose 
normal  DNA  samples  were  polymorphic  at  a  given  locus. 
Patients  who  were  uninformative  were  not  considered. 
LOH  was  identified,  visually  or  following  Phosphoimager 
densitometric  analysis,  as  a  loss  in  intensity  (>50%)  or 
complete  loss  of  one  allele  in  the  tumor  DNA  when 
compared  with  the  normal  DNA  from  the  same  patient. 
All  cases  of  LOH  were  confirmed  by  three  separate 
experiments  with  two  different  reviewers.  It  is  worth 
noting  that  we  have  been  very  conservative  in  our 
appreciation  of  LOH.  In  particular,  we  did  not  count  as 
LOH  any  sample  where  both  alleles  in  the  normal  control 
were  not  clear.  This  bias  possibly  led  us  to  underestimate 
LOH  frequencies.  In  certain  cases,  additional  markers  have 
been  used  in  order  to  define  more  precisely  the  deleted 
region. 
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Abstract 

The  human  CUTLl  gene  (Cut-like  1 )  is  a  candidate  tumor  suppressor  gene  loeated  on  ehromosome  7  at  band  22,  a  region 
that  is  frequently  deleted  in  several  human  cancers.  The  gene  spans  at  least  340  kb  and  contains  33  exons.  Synthesis  of  five 
different  transcripts  involves  two  promoter  regions,  two  polyadenylation  sites  and  seven  alternative  splicing  events.  The  two 
polyadenylation  sites  are  located  at  the  ends  of  exons  24  and  33  and  are  separated  by  approximately  40  kb.  Transcription  is 
initiated  in  two  genomic  regions,  giving  rise  to  alternate  first  exons  which  are  spliced  to  a  common  exon  2.  All  transcripts  contain 
exons  2  to  14,  but  differ  in  their  3'  regions.  Exon  14  can  be  spliced  alternatively  to  the  beginning  or  the  middle  of  exon  15,  or  to 
exon  25,  generating  transcripts  with  exons  15  to  24  or  exons  25  to  33.  Moreover,  exon  16  can  be  spliced  out  from  the  mature 
transcripts  that  contain  exons  15  to  24.  Overall,  five  distinct  transcripts  are  generated  as  a  result  of  alternative  transcription 
initiation,  splicing  and  polyadenylation.  We  discuss  potential  mechanisms  by  which  alternate  polyadenylation  site  usage  may  affect 
alternative  splicing  events  and  vice  versa.  ©  2000  Elsevier  Science  B.V.  All  rights  reserved. 

Keywords.'  Alternative  polyadenylation  and  splicing;  CCAAT  displacement  protein;  Human  Cut  homeobox  gene 


1.  Introduction 

The  cDNAs  encoding  for  homologues  of  the 
Drosophila  Cut  homeodomain  protein  have  recently 
been  isolated  from  several  mammalian  species,  including 
human,  dog,  mouse  and  rat,  and  were  respectively 
termed  CDP/Cut  (CCAAT  displacement  protein/human 
Cut).  Clox  (Cut-like  homeobox).  Cux  (Cut  homeobojO 
and  CDP-2  (Andres  et  al.,  1992;  Neufeld  et  al.,  1992; 
Valarche  et  al.,  1993;  Dufort  and  Nepveu,  1994;  Yoon 
and  Chikaraishi,  1994).  More  recently,  a  second  Cux 
gene,  called  Cux-2,  was  identified  in  mouse.  In  contrast 
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to  Cux-1,  which  is  expressed  in  most  tissues,  Cux-2  was 
found  to  be  expressed  exclusively  in  nervous  tissues 
(Quaggin  et  al.,  1996).  To  simplify  the  terminology,  the 
term  ‘Cut’  will  be  used  in  this  paper  to  designate 
the  protein. 

Sequence  homology  between  Drosophila  melanogaster 
and  mammalian  Cut  proteins  is  limited  to  five  evolution- 
arily  conserved  domains:  a  region  predicted  to  form  a 
coiled-coil  structure  (CC),  three  regions  of  ~70  amino 
acids,  the  Cut  repeats  (CRl,  CR2  and  CR3)  and  a  Cut- 
type  homeodomain  (HD)  (Blochlinger  et  al.,  1988).  The 
Cut  repeats  were  shown  to  function  as  specific  DNA- 
binding  domains  and,  at  least  for  Cut  repeat  3,  to 
cooperate  with  the  Cut  homeodomain  to  bind  to  DNA 
(CR3HD)  (Andres  et  al.,  1994;  Aufiero  et  al.,  1994; 
Harada  et  al.,  1994).  In  addition,  two  active  repression 
domains  have  been  identified  in  the  carboxy-terminal 
domain  of  the  protein  (Mailly  et  al.,  1996).  In  several 
vertebrate  species,  the  gene  was  found  to  encode  for  a 
second  protein,  called  CASP  (cut  alternatively  spliced 
product),  which  contains  the  coiled-coil  region  but  none 
of  the  DNA-binding  domains  (Lievens  et  al.,  1997). 

Genetic  studies  in  Drosophila  melanogaster  indicated 
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that  cut  plays  an  important  role  in  the  determination 
and  maintenance  of  cell-type  specificity  in  several  tissues. 
Mutations  in  cut  cause  defects  in  several  tissues  including 
the  wings,  legs,  external  sense  organs,  Malpighian 
tubules,  tracheal  system  and  some  structures  in  the 
central  nervous  systems  (Liu  et  al.,  1991).  In  most 
instances,  defects  appear  to  result  from  the  fact  that 
some  cells  have  enrolled  in  the  wrong  developmental 
program  (Blochlinger  et  al.,  1991;  Liu  et  al.,  1991). 
Ectopic  expression  of  Cut  proteins  from  either 
Drosophila  melanogaster,  man  or  mouse,  was  shown  to 
have  a  similar  effect  on  embryonic  sensory  organ  devel¬ 
opment  in  Drosophila  melanogaster  (Ludlow  et  al., 
1996).  Moreover,  the  ‘cut  wing’  phenotype  was  rescued 
by  either  proteins  (Ludlow  et  al.,  1996).  These  findings 
suggest  that  mammalian  Cut  proteins  also  play  an 
important  role  in  determining  cell-type  specificity. 

In  vertebrates.  Cut  proteins  were  originally  identified 
as  CCAAT  displacement  proteins  (CDP)  (Barberis  et  al., 
1987;  Skalnik  et  al.,  1991;  Neufeld  et  al.,  1992). 
CDP/Cut  DNA-binding  activity  was  found  to  disappear 
upon  differentiation  of  myeloid  cells  (Skalnik  et  al., 
1991;  Lievens  et  al.,  1995),  and  in  the  kidney,  expression 
of  the  mouse  CDP/Cut  protein,  Cux-1,  was  found  to  be 
inversely  related  to  the  degree  of  cellular  differentiation 
(Heuvel  et  al.,  1996).  It  was  thus  proposed  that  mamma¬ 
lian  CDP/Cut  proteins  function  as  transcriptional 
repressors  that  inhibit  terminally  differentiated  gene 
expression  during  early  stages  of  differentiation  (Skalnik 
et  al.,  1991;  Lievens  et  al.,  1995;  Heuvel  et  al.,  1996). 
More  recently.  Cut  proteins  were  found  to  play  a  role 
in  cell  cycle  progression.  Cut  DNA-binding  activity  was 
minimal  in  GO  and  early  Gl,  but  raised  at  the  end  of 
G1  and  during  S  phase  as  a  result  of  increased  expression 
as  well  as  dephosphorylation  by  the  Cdc25A  phospha¬ 
tase.  Increased  Cut  activity  was  associated  with  the 
transcriptional  repression  of  p21'''afi/cipi/sdi  (Coqueret 
et  al.,  1998).  Moreover,  CDP/Cut  was  found  to  be  a 
component  of  the  promoter  complex  HiNF-D,  which  is 
believed  to  participate  in  the  induction  of  transcription 
of  several  histone  genes  at  the  Gl/S  phase  transition  of 
the  cell  cycle  (Vanwijnen  et  al,  1996).  The  molecular 
basis  for  this  dual  role  of  CDP/Cut  as  a  transcriptional 
repressor  or  activator  remains  to  be  determined.  It  is 
possible  that  alternative  gene  products  are  expressed  in 
different  cell  types  or  at  different  times,  as  suggested 
form  Northern  blot  analysis  and  cDNA  sequencing 
(Neufeld  et  al,  1992;  Valarche  et  al.,  1993;  Lievens 
et  al.,  1997).  Another  interesting  possibility  is  that 
CDP/Cut  may  alternatively  interact  with  either  pRb  or 
pl07  on  different  promoters  (Vanwijnen  et  al.,  1996). 

Accumulated  evidence  indicates  that  CUTLl  is  a 
candidate  tumor  suppressor  gene.  The  human  CDP/Cut 
protein  was  shown  to  repress  c-Myc,  a  proto-oncogene 
that  is  frequently  activated  in  several  cancers  (Dufort 
and  Nepveu,  1994).  Cytogenetic  studies  revealed  that 


the  7q22  chromosomal  region  is  frequently  deleted  in 
some  human  cancers,  including  leiomyomas,  acute 
myeloid  leukemia  (AML),  and  myelodysplastic  syn¬ 
drome  (MDS).  The  loss  of  heterozygosity  (LOH)  at 
7q22  has  been  reported  in  a  subset  of  uterine  leiomyomas 
and  breast  tumors  (Ishwad  et  al.,  1997;  Zeng  et  al., 
1997,  1999).  In  every  case,  the  smallest  deleted  region 
included  one  or  more  of  three  polymorphic  markers 
that  are  located  within  or  close  to  the  CUTLl  gene. 
Moreover,  CUTLl  mRNA  levels  were  reduced  in  eight 
tumors  out  of  13  (Zeng  et  al.,  1999).  Interestingly,  in 
both  leiomyomas  and  mammary  tumors  induced  in 
transgenic  mice  expressing  the  Polyomavirus  (PyV) 
Large  T  (LT)  antigen,  immunocomplexes  of  CUTLl 
and  PyV  LT  antigen  were  detected  (Webster  et  al., 
1998).  The  existence  of  such  complexes  suggested  that 
alterations  in  the  function  of  the  Cut  protein  may  be  an 
important  event  in  the  etiology  of  some  cancers. 

To  obtain  a  complete  list  of  transcript  variants  and 
to  facilitate  a  search  for  mutations  within  CUTLl  in 
human  cancer,  we  have  defined  the  exon-intron  struc¬ 
ture  of  the  human  CUTLl  gene  and  characterized  its 
multiple  transcripts.  This  study  has  revealed  a  very 
complex  gene  organization  that  allows  the  production 
of  five  alternative  CUTLl  transcripts.  These  findings 
raise  interesting  questions  regarding  the  regulation  of 
alternative  splicing  and  polyadenylation. 

2.  Materials  and  methods 

2. 1.  Sequences  of  CUTLl  cDNAs  and  genomic  DMA 
clones 

The  cDNA  sequences  of  HS  CDP/Cut,  HS  CASP 
and  MM  CASP  can  be  found  at  GenBank,  Accession 
Nos.  M74099,  LI  2579,  U68542  and  U66249.  The 
Accession  Nos.  for  the  CUTLl  genomic  DNA  clones 
are  as  follows:  46f6  (AF006752),  Pac  50h2  (AF047825); 
cosmid  186D2  (AF024534),  cosmid  123E15  (AF024533) 
and  Pac  76h2  (AF030453).  Sequence  data  for  contigs 
on  7q22  can  be  obtained  at  the  Washington  University 
Human  Genome  Sequencing  Project’s  World  Wide  Web 
site  (http://genome.wustl.edu/gsc/gschmpg.html).  The 
appropriate  contigs  are:  7/H  RG339C12  (AC005096). 

2.2.  Isolation  of  genomic  clones 

A  human  genomic  placental  DNA  library  cloned  in 
the  bacteriophage  vector  7.DASH  1  (Stratagene)  was 
screened  with  a  CDP/Cut  cDNA  probe  containing  nt 
18  to  4881.  Positive  clones  were  selected,  purified  and 
amplified  in  LE392  bacteria  using  standard  procedures. 
The  Lawrence  Livermore  National  Laboratory  chromo¬ 
some  7-specific  cosmid  library  (obtained  from  Dr.  S.W. 
Scherer)  was  screened  with  the  CDP/Cut  cDNA  probe 
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to  identify  overlapping  cosmid  clones  containing  geno¬ 
mic  DNA  sequences  not  isolated  from  the  phage  library. 
DNA  from  each  phage  or  cosmid  clone  was  isolated 
and  used  in  cycle  sequencing  reactions  to  determine  the 
intron-exon  boundaries  of  CDP/Cut. 

2.3.  Plasmid  construction 

The  plasmid  containing  genomic  DNA  on  either  side 
of  exon  IB,  pCR2.1-ll,  was  obtained  in  the  following 
manner.  A  fragment  of  637  bp  was  generated  by  PCR 
amplification  of  human  genomic  DNA  using  as  primers 
the  oligonucleotides  GGTGCGGCGCGGGAGGA  and 
GAGCATTCTAAGCCGCGGGA  and  was  subcloned 
into  the  pCR2.1  vector  (InVitrogen).  The  plasmid  con¬ 
taining  genomic  DNA  on  either  side  of  exon  2,  Clone 
57,  was  obtained  by  subcloning  a  1.4  kb  BamHl  frag¬ 
ment  from  pac68m21  into  the  corresponding  site  of  the 
vector  bluescribe  pKS  (Stratagene). 

2. 4.  Cycle  sequencing  of  phage  and  cosmid  DNA 

The  appropriate  purified  genomic  DNA  from  phage 
or  cosmid  clones  (500  ng)  was  used  in  cycle  sequencing 
reactions  with  CDP/Cut-specific  primers  to  determine 
the  intron-exon  junctions  and  flanking  intron  sequences. 
These  reactions  were  carried  out  using  the  GibcoBRL 
dsDNA  Cycle  Sequencing  System  (GibcoBRL, 
Burlington,  Ontario)  according  to  the  instructions 
included  in  the  kit.  The  sequences  of  the  primers  used 
for  this  purpose  are  available  from  the  authors. 

2.5.  PCR  to  estimate  the  sizes  of  introns 

To  estimate  the  sizes  of  the  introns  of  the  CUTLl 
gene,  appropriate  CDP/Cut-specific  oligonucleotide 
primers  were  generated  and  used  in  PCR  reactions  to 
amplify  each  individual  intron.  PCR  for  introns  up  to 
2  kb  in  size  was  carried  out  in  a  final  volume  of  50  pi, 
containing  100  ng  of  genomic  DNA  prepared  from 
human  placenta  or  the  human  cell  lines  HeLa  and  293, 
1.5  mM  MgCl2,  5  pi  of  standard  10  x  PCR  buffer 
(200  mM  Tris-HCl,  pH  8.4,  500  mM  KCl),  50  pmol  of 
each  primer,  1.25  mM  dNTPs,  and  2.5  units  of  Taq 
polymerase  (GibcoBRL,  Burlington,  Ontario).  An  initial 
step  of  3  min  at  85°C  was  followed  by  35  cycles  of  40  s 
of  denaturation  at  94°C,  30  s  of  annealing  at  55°C,  and 
40  s  of  extension  at  72°C,  followed  by  a  final  extension 
step  of  2  min  at  72°C.  Introns  between  2  and  6  kb  were 
amplified  in  conditions  as  described  above,  with  the 
addition  of  2.5  units  of  Taq  extender  PCR  additive 
(Stratagene)  and  substituting  the  Taq  extender  buffer 
for  the  regular  PCR  buffer.  Introns  larger  than  6  kb 
were  amplified  using  the  Expand  Long  Template  PCR 
System  (Boehringer  Mannheim)  according  to  the 
instructions  included  in  the  kit.  PCR  products  were 


analyzed  by  electrophoresis  of  1/1 0th  of  the  reaction  on 
a  1%  agarose  gel.  The  size  of  each  intron  was  estimated 
by  comparison  of  the  PCR  product  with  molecular  size 
markers.  Each  intron  was  amplified  at  least  twice  from 
two  different  cell  lines  (HeLa  and  293).  The  sizes  of 
introns  5,  6,  7,  10,  12-19  (incl.),  21,  22  and  23  were 
estimated  in  this  manner.  Intron  sizes  obtained  experi¬ 
mentally  were  also  confirmed  using  genomic  DNA 
sequence  information  (see  below).  The  sizes  of  introns 
1,  2,  3,  4,  8,  9,  11  and  20  were  obtained  from  analysis 
of  genomic  DNA  sequences  of  the  appropriate  portion 
of  chromosome  7  (see  Fig.  1  A),  available  at  the  National 
Center  for  Biotechnology  Information  (NCBI)  using 
BLAST  search  tools  (http://www.ncbi.nlm.nih.gov/ 
cgi-bin/BLAST/nph-blast?Jform  =  0).  The  appropriate 
clones  are:  Pac50h2,  cosmid  186D2,  cosmid  123E15  and 
PAC  76h2. 

2.6.  RNA  isolation 

Total  RNA  was  isolated  from  various  human  cell 
lines  by  the  method  of  Chomczynski  and  Sacchi  (1987). 

2.7.  RNase  mapping 

The  riboprobes  were  prepared  by  combining  1  pg  of 
template  DNA,  transcription  buffer  (200  mM  PIPES, 
2M  NaCl,  5mM  EDTA),  10  mM  DTT,  RNasin 
(40  units),  500  mM  ATP,  CTP,  GTP,  12mM  UTP, 
50  pCi  a-^^P-UTP  and  T7  RNA  polymerase  (69  units) 
(Pharmacia)  and  then  incubating  for  1  h  at  37°C.  After 
1  h,  500  mM  UTP  was  added  and  incubated  for  a  further 
5  min.  Next,  the  riboprobes  were  treated  with  RNase- 
free  DNase  at  37°C  for  1 5  min  and  then  extracted  with 
chloropane  and  run  through  a  Sephadex  G50  spun 
column.  Forty  micrograms  of  total  RNA  were  annealed 
to  8  X  10*  cpm  of  labeled  riboprobe  at  54°C  for  16  h  in 
80%  formamide-0.4  M  NaCl-0.4  M  piperazine-Af,N- 
bis  (2-ethanesulfonic  acid)  (PIPES)  (pH  6.4)-l  mM 
EDTA.  RNA-RNA  hybrids  were  digested  with  30  U  of 
RNase  T2  (Gibco)  per  ml  at  30°C  for  1  h.  After  digestion 
hybrids  were  precipitated  with  20  pg  of  tRNA,  295  pi 
of  4  M  guanidine  thiocyanate  and  590  pi  of  isopropanol. 
Pellets  were  resuspended  in  80%  formamide,  1  x  TBE 
and  0. 1  %  XC  -f  BPB,  denatured  and  electrophoresed  on 
4%  acrylamide-8  M  urea  gel.  Gels  were  dried  and 
exposed  to  X-ray  film  with  intensifying  screens  at  —  80°C 
for  the  appropriate  period  of  time. 

2.8.  Reverse  transcriptase-polymerase  chain  reaction 
(RT-PCR) 

RT-PCR  was  performed  on  Human  Multiple  Tissue 
cDNA  (MTC®)  of  normalized,  first-strand  cDNA  prep¬ 
arations  derived  from  different  adult  human  tissues, 
purchased  directly  from  Clontech  (Palo  Alto,  CA). 
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Fig.  1.  Structure  of  the  CUTLl  gene,  mRNAs  and  proteins,  (A)  Exon/intron  structure  of  the  CUTLl  gene.  Vertical  lines  represent  individual 
exons.  Three  polymorphic  markers  previously  used  in  LOH  studies  are  shown  above  the  map  and  in  boxes.  Various  genomic  clones  that  were  used 
to  determine  the  exon/intron  structure  of  the  gene  are  shown  under  the  map.  The  distances  between  exons  lA  (CASP-specific)  and  IB  (CDP/Cut- 
specific),  and  between  IB  and  2,  are  not  yet  known.  Similarly,  the  sizes  of  introns  2  and  8  are  not  known,  but  must  be  at  least  84  and  38  kb, 
respectively.  (B)  Full-length  CDP/Cut  mRNA  composed  of  24  exons.  Note  that  exon  15  has  two  splice  acceptor  sites  at  nt  1266  or  1572,  and  that 
exon  16  may  or  may  not  be  included,  as  shown  in  (D).  The  exons  are  aligned  with  the  evolutionarily  conserved  domains  of  the  protein,  as  shown 
in  (C).  (C)  Full-length  CDP/Cut  protein.  Black  rectangles  represent  the  five  evolutionarily  conserved  regions  of  the  proteins:  coiled-coil  (CC),  cut 
repeats  1  (CRl),  2  (CR2)  and  3  (CR3)  and  the  homeodomain  (HD).  Two  active  repression  domains  (R)  are  shown  as  hatched  boxes  as  previously 
defined  (Mailly  et  al.,  1996).  (D)  Alternative  splicing  events  and  their  products.  Seven  alternative  splicing  events  are  shown  as  dashed  lines.  Note 
that  splicing  event  #1  is  always  followed  by  splicing  event  #5,  while  #2  is  linked  to  #3  or  #4.  Three  and  two  alternative  splice  acceptor  sites  can  be 
used  downstream  of  exon  14  (3,  4  and  5)  and  15  (6  and  7),  respectively.  The  five  alternatively  spliced  mRNAs  are  shown  below. 


MTC  cDNA  ranges  in  size  from  0. 1  to  at  least  6  kb  and 
is  virtually  free  of  genomic  DNA  (Supplier  information). 
The  cDNA  preparations  within  each  MTC  Panel  and 
between  human  panels  have  been  carefully  normalized 


to  permit  comparisons  of  relative  abundance  of  specific 
transcripts  within  panels  and  between  human  panels. 
cDNA  was  amplified  by  oligos  flanking  the  two  deletions 
at  nt  1266-1570  and  nt  1938-2003  (HS  CDP/Cut, 
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GenBank,  Accession  No.  M74099).  The  sequences  of 
the  oligos  are:  FI  (nt  1186-1207):  GGAGGT- 
GCTGTTGCTGGAGAAG  and  B1  (nt  2146-2165): 
AGCGGATGGGCTCATCAGAG.  The  amplified  frag¬ 
ments  are  981  bp  for  full  length,  676  bp  for  deletion  of 
nt  1266-1570  (deletion  5'  of  CRl),  915  bp  for  deletion 
of  1938-2003,  and  610  bp  for  deletion  of  both.  The 
plasmid  PMX/Cut  eontaining  full-length  CDP/Cut 
served  as  a  positive  eontrol,  and  the  negative  eontrol 
had  no  DNA  template.  The  specificity  of  the  PCR 
product  was  confirmed  by  BamYil  digestion  to  produce 
the  expected  sizes  of  fragments  as  well  as  Southern  blot 
hybridization  (not  shown).  PCR  was  performed  in  a 
final  volume  of  50  pi,  containing  1  ng  of  cDNA,  1.5  mM 
MgCl2,  5  pi  of  standard  10  x  PCR  buffer  (200  mM 
Tris-HCl,  pH  8.4,  500  mM  KCl),  0.45  pm  of  each 
primer,  0.12  mM  dNTPs,  5%  DMSO,  1  pCi  [a-^^P] 
dCTP,  and  1  unit  of  Taq  polymerase  (Gibco  BRL, 
Burlington,  Ontario).  An  initial  step  of  4  min  at  95°C 
was  followed  by  30  cycles  of  1  min  of  denaturation  at 


95°C,  30  s  of  annealing  at  59°C,  and  2  min  of  extension 
at  72°C,  followed  by  a  final  extension  step  of  7  min  at 
72°C.  Pilot  tests  have  been  carried  out  to  make  sure  the 
PCR  reaction  does  not  reach  its  plateau  (not  shown). 
PCR  products  were  separated  on  a  4%  non-denaturing 
polyacrylamide  gel  and  visualized  by  autoradiography. 


3.  Results 

3.1.  CUTLl  exonfintron  structure 

One  of  our  long  term  goals  is  to  determine,  in  tumors 
with  CUTLl  LOH,  whether  the  remaining  allele  is 
mutated.  For  most  tumors,  it  will  not  be  possible  to 
obtain  RNA  that  would  serve  to  clone  CUTLl  cDNAs. 
Thus,  the  search  for  mutation  will  have  to  be  performed 
with  genomic  DNA.  For  this  purpose,  it  is  necessary 
first  to  determine  the  CUTLl  exon/intron  structure. 
Including  the  sequence  at  the  5'  and  3'  end  of  introns. 


Table  1 

Nucleotide  sequence  of  exon/intron  junctions  of  the  CUTLl  gene* 


CUTLl  exon  no. 

5'  end 

3'  end 

5'  end  sequence  of  exon 

y  end  sequence  of  exon 

Intron  size  (bp) 

IB 

1 

73 

Multiple  5'  ends 

CAGGTTGAAGgtgagcggcgtgtgg 

2 

74 

184 

ttccccaacagAGAGAACTCG 

CACTCCAGAGgtgaggcgcgt 

>84334 

3 

185 

232 

cgcctcctgctccagGATTTGCGCA 

CCAAGGAGAGgtaagciiltclatt 

41  309 

4 

233 

311 

ttccttccctttcagATTGATGCAC 

GACGTCCCAGgtaagccccggcagi 

27029 

5 

312 

449 

tItcctgtigIgcagATCCCGTACC 

AAAAATCAAGgtIggIggaaaatgc 

mi 

6 

450 

573 

ttttcttttctgcagAGG'TTACGAT 

AAAAGGAGAGgtgagcalgacItcc 

7371 

7 

574 

650 

cctcttcctttgcagAAAGCTGCAG 

CTACAAACAGgtIllgatacIctcc 

3430 

8 

651 

717 

ttctrtgcctttcagCCCTGGAAAA 

CTACTGCAAAgtaagtclclctgcl 

>38790 

9 

718 

766 

ctctcaccctcctagGGCCGACGAG 

GGCAAACCAGgtaggaccctggacg 

11610 

10 

767  . 

871 

tttcatftcctfcagAGGGCAGAGG 

ACCAGACGTGgtgggtagccccggc 

7971 

11 

872 

1060 

tittctcctccccagGAGCAGGCCA 

CACACTCAAAgtaagggggcIgcgg 

11  155 

12 

1061 

1119 

ttcttctcitttcagCAACTGGAAG 

AAGAGCTGAAgtaagtacggagagc 

3969 

13 

1120 

1168 

gttgtgctcftgcagCATTCTGAAG 

TGGGACACAGglacgtgtctcacct 

1614 

14 

1169 

1265 

ctctgccccttctagGAiTGCGGCCA 

GACCTGAGCGgtagglIggccgggc 

1051 

15 

1266 

1937 

fgcccttccftgtagGGTCAGCCAG 

AGACAAAGAGgtgagagactggcgt 

1495 

16 

1938 

2003 

cttgtttttcftcag  AGAATCCAGG 

GGGTCTGAAGgtatgtlgcaggcag 

1202 

17 

2004 

2105 

oacttctccccacagGTAACATCAC 

CAGAAAACTGgtacagcttccattt 

1186 

18 

2106 

2950 

tgtttctccatgcagCAGAGCCGGC 

CGGGGAGAAGgtaagggatctgcIc 

2185 

19 

2951 

3116 

tcggtgccactccagGTGCTGGGCC 

CAAGGGCCAGgtaatgggggIcctg 

556 

20 

3117 

3173 

ctttttctacfttagTCCXCCACTC 

GTGAGCTCAGgtaagcagcagtttc 

22184 

21 

3174 

3476 

gtgctttaattacag  AAAGCACTCC 

AACAACCTCGgtaggttctcctctc 

6374 

22 

3477 

3665 

ctgttUctctcta^CX^AGCGCTT 

GAGAAGAAAGgtaagtctccctgcc 

5077 

23 

3666 

3930 

tctcttcctccgcagCXTTACATGAA 

ACAACTACAGgtacgacggctggca 

8826 

24 

3931 

5376 

cttgttgtcttgtagGTCTCGGATC 

TTACTCCACAtattttttaacaaaa 

24006 

25 

1275 

1402 

ttcctttggccacagGACGCTGTGC 

CGATGCCGAGgtgagcccacccccc 

752 

26 

1403 

1469 

tcctgtcacctgcagGGTGCCGCTG 

CVATTCTACGglaaggagaggcclg 

937 

27 

1470 

1582 

tcctctaccIgetagGACCTGCAGC 

GCTTGAGGCCgtgagtcacatccti 

2591 

28 

1583 

1699 

cccctccccccccagGAGAACCGCC 

CCCTGGCCGGgtgagggccctcccc 

2061 

29 

1700 

1783 

ctcctgtctgtgcagGGCAGCCGGC 

CAGCAAGCGGgttcgtgagcccagc 

682 

30 

1784 

1840 

tccctccctgtgcagGAGCGGCAGA 

CCTCAGCATGgfgagfccclgcccc 

970 

31 

1841 

1922 

cctccttgctcccagGGGCGTCTGG 

GGTCTTCCTGgtgaglgfgcacacg 

792 

32 

1923 

1986 

tctccacctggccagGTGCTCTACA 

GCGCCAAGAAgtgaggacccccact 

245 

33 

1987 

2855 

tcccgtgtcccccagGTTCGCTGAC 

CACVGGCAGGccttcctgagltaga 

•Nucleotide  numbers  at  the  5'  and  3'  ends  of  exons  IB  to  24  correspond  to  that  of  the  CUTLl  cDNA  sequence  (GenBank  Accession  No. 
M74099).  Nucleotide  numbers  at  the  5'  and  3'  ends  of  exons  25  to  33  correspond  to  that  of  the  CDP/Cut  cDNA  sequence  (GenBank  Accession 
No.  M74099).  Intron  and  exon  nucleotide  sequences  are  shown  in  lower-case  and  upper-case  letters,  respectively. 
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The  sequence  information  obtained  in  this  manner  will 
then  serve  to  design  oligonucleotides  that  can  be  used 
as  primers  to  PCR-amplify  individual  exons  as  well  as 
some  of  the  flanking  intronic  sequences. 

We  have  now  determined  the  complete  exon/intron 
structure  of  the  CUTLl  gene  (Fig.  lA).  Starting  from 
a  panel  of  cosmid  clones  encompassing  the  entirety  of 
the  gene,  we  performed  DNA  sequencing  using  oligonu¬ 
cleotides  derived  from  the  cDNA  sequence  as  primers. 
In  addition,  DNA  sequencing  of  the  long  arm  of  chro¬ 
mosome  7,  as  part  of  the  human  genome  project,  has 
provided  the  DNA  sequence  of  almost  the  entirety  of 
the  CUTLl  gene,  with  only  four  gaps  (see  Fig.  lA, 
genomic  DNA  clones).  The  chromosome  7  DNA 
sequence  was  analyzed  periodically  to  identify  stretches 
that  would  contain  CUTLl  cDNA  sequence.  In  this 
manner,  we  have  confirmed  some  of  the  results  obtained 
through  our  own  DNA  sequencing  project  and  found 
some  exon/intron  junctions  that  had  not  previously  been 
identified.  The  exon  structure  of  the  full-length  CDP/Cut 
mRNA  is  shown  in  Fig.  IB  and  is  aligned  with  the 
functional  domains  of  the  protein  (Fig.  1C).  The  DNA 
sequence  at  the  exon/intron  junction  is  presented  in 
Table  1.  Alternative  splicing  events  were  also  detected 
by  the  cloning  of  various  cDNAs  (Fig.  ID)  and  were 
confirmed  by  RT-PCR  and  RNase  mapping  analyses 
(Figs.  3  and  4;  see  Sections  3.5  and  3.6).  The  structure 
of  the  alternatively  spliced  mRNAs  of  the  CUTLl  gene 
is  shown  diagrammatically  (Fig.  ID).  All  transcripts 
include  exons  2  to  14,  but  differ  5'  and  3'  of  this  common 
core.  There  are  two  alternative  exons  1,  1 A  and  IB,  and 
two  alternative  last  exons,  24  and  33.  Splicing  down¬ 
stream  of  intron  14  can  take  place  at  three  different 
positions:  at  the  start  of  exon  15  (nt  1266  in  HS  CDP), 
within  exon  1 5  (nt  1 570  in  HS  CDP)  and  at  the  start  of 
exon  25  (nt  1275  in  HS  CASP).  Moreover,  exon  16  can 
be  spliced  out  in  those  transcripts  that  contain  exon  1 5. 
Thus,  transcripts  that  end  with  exon  33  contain  exons 
25  to  33  downstream  of  exon  14,  whereas  transcripts 
that  end  with  exon  24  contain  exons  1 5  to  24  (with  the 
exception  of  exon  16  which  may  or  may  not  be 
included).  As  a  result  of  these  alternative  initiation, 
splicing  and  polyadenylation  events,  up  to  five  distinct 
transcripts  can  be  expressed  (Fig.  ID).  The  process, 
splicing  or  polyadenylation,  that  dictates  the  course  of 
events  will  be  discussed  in  Section  4.3. 

3.2.  Mapping  of  polymorphic  markers  relative  to  CUTLl 

We  had  previously  identified  three  polymorphic  mark¬ 
ers  within,  or  close  to,  the  CUTLl  gene;  D7S515, 
D7S666  and  D7S518  (Zeng  et  al.,  1997).  The  cloning 
of  CUTLl  genomic  DNA  allowed  us  to  precisely  map 
these  markers  by  Southern  blot  analysis  (data  not 
shown).  The  positions  of  these  three  markers  were 
confirmed  by  sequence  analysis  of  genomic  clones 


PAC50h2  and  cos  123E15  (see  Fig.  lA).  D7S515  is 
located  within  intron  3,  D7S666  in  intron  6  and  D7S518 
in  intron  20.  Thus,  all  three  markers  are  positioned 
within  the  CUTLl  gene. 


3.3.  Mapping  of  CUTLl  transcription  start  sites 

A  genomic  DNA  fragment  corresponding  to  the  5' 
end  of  the  CUTLl  cDNA  was  subcloned  into  the 
pCR2.1  vector.  No  TATA  box  was  found  within  approx¬ 
imately  1  kb  of  DNA  sequence.  A  riboprobe  was  made 
and  used  in  RNase  mapping  analysis  to  map  the  5'  end 
of  CDP/Cut  transcripts  and,  by  inference,  the  transcrip¬ 
tion  start  site(s)  of  the  gene  (Fig.  2A).  Several  protected 
fragments  were  observed,  suggesting  that  transcription 
starts  at  various  positions,  a  finding  that  is  consistent 
with  the  absence  of  a  TATA  box  in  this  region.  The 
positions  of  transcription  start  sites  are  shown  on  the 
sequence  (Fig.  2B).  Importantly,  many  mRNAs  appear 
to  be  initiated  upstream  of  the  position  predicted  from 
the  published  CDP/Cut  cDNA  sequence  (Fig.  2B; 
CDP/Cut  cDNA:  GenBank  Accession  No.  M74099). 

3.4.  Mapping  of  potential  CUTLl  translation  start  sites 

The  translation  start  site  in  the  CDP/Cut  cDNA 
sequence  has  been  arbitrarily  positioned  at  nt  44,  where 
the  first  methionine  codon  is  found  (position  550  in  the 
sequence  of  Fig.  2B).  However,  there  is  no  termination 
codon  preceding  it  in  the  same  reading  frame. 
Interestingly,  when  the  upstream  genomic  sequence  is 
abutted  to  the  known  sequence  of  CDP/Cut,  the  open 
reading  can  be  extended  towards  the  5'  end,  and  addi¬ 
tional  methionine  codons  can  be  found  (positions  250 
and  337,  Fig.  2B).  This  finding  raised  the  possibility 
that  mRNAs  that  are  longer  at  their  5'  end  could  encode 
proteins  with  extended  amino-terminal  domains.  To 
investigate  this,  we  have  carefully  mapped  the  multiple 
transcription  start  sites  and  positioned  them  on  the 
sequence  shown  in  Fig.  2B.  We  found  that  most  mRNAs 
were  initiated  downstream  of  the  ATG  codon  at  position 
337  and  therefore  did  not  include  those  upstream  ATG 
codons.  However,  one  major  transcript  was  initiated  at 
position  334  in  HeLa  and  293  cells.  It  is  thus  possible 
that  this  mRNA  encodes  for  a  protein  whose  translation 
would  start  at  nt  337.  Importantly,  in  RNase  mapping, 
no  protected  fragment  was  observed  at  the  position 
expected  for  a  transcript  that  would  initiate  upstream 
of  the  Ncol  site.  Thus,  we  conclude  that  in  no  case  can 
translation  start  at  this  Ncol  site,  and  for  the  vast 
majority  of  transcripts,  translation  must  initiate  at  the 
ATG  codon  previously  identified  (position  550  in 
Fig.  2B;  nt  44  in  HS  CDP  sequenee,  M74099). 
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Start  of  HS  CDP  cDN  A  sequence 


Fig.  2.  RNase  mapping  analysis  of  CUTLl  mRNA  5'  ends.  (A)  Genomic  fragment  corresponding  to  the  5'  end  of  the  CDP/Cut  cDNA  subcloned 
into  the  pCR2.1  vector  (InVitrogen).  A  riboprobe  was  prepared  and  used  in  RNase  mapping  analysis  using  30  pg  of  total  RNA  from  human 
placenta  as  well  as  Hel,  293  and  HeLa  cell  lines.  The  same  amount  of  tRNA  was  used  as  a  control.  The  undigested  riboprobe  as  well  as  the  major 
protected  fragments  are  indicated  by  arrows.  Since  the  molecular  weight  markers  were  DNA  fragments,  the  calculated  sizes  of  protected  fragments 
are  approximate.  (B)  Genomic  DNA  sequence  at  the  5'  end  of  exon  IB  of  CDP/Cut.  The  published  sequence  of  CDP/Cut  starts  at  nt  513,  and 
exon  IB  sequence  ends  at  nt  579.  The  amino  acid  sequence  is  shown  under  the  nucleotide  sequence.  In  frame  with  the  long  open  reading  frame 
are  three  ATG  codons  at  positions  250,  337  and  550.  The  underlined  region  corresponds  to  the  suspected  amino-terminus  of  CDP/Cut.  The 
approximate  positions  of  transcripts  5'  ends  are  indicated  by  the  arrows.  Note  that  only  one  transcript,  present  in  low  abundance  in  293,  would 
start  upstream  of  the  second  ATG  codon  at  337. 


3.5.  Ubiquitous  expression  of  the  alternatively  spliced 
CDP  I  Cut  transcripts 

We  have  previously  obtained  several  CDP/Cut 
cDNAs  with  internal  deletions  at  either  of  two  positions. 
Some  cDNAs  lacked  sequences  from  nt  1266  to  1569, 
at  the  beginning  of  exon  15.  Others  missed  the  entire 


exon  16,  from  nt  1938  to  2105,  as  previously  reported 
(Neufeld  et  al.,  1992).  To  confirm  whether  alternatively 
spliced  transcripts  are  expressed  and  to  verify  whether 
expression  of  these  transcripts  follows  a  tissue-specific 
pattern,  we  performed  an  RT~PCR  analysis  using  total 
RNAs  from  various  human  tissues.  Two  oligonucleo¬ 
tides,  located  in  exons  14  and  18,  respectively,  were 
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Fig.  3.  Expression  of  CDP/Cut  alternative  mRNAs  in  various  tissues  CDP/Cut  alternative  mRNAs  were  analyzed  by  reverse  transcriptase  polymerase 
chain  amplification  (RT-PCR)  using  Human  Multiple  Tissue  cDNA  (MTC®)  of  normalized,  first-strand  cDNA  preparations  derived  from 
different  adult  human  tissues  (Clontech).  Using  oligonucleotides  corresponding  to  nt  1186-1207  and  nt  2146-2165  as  primers,  the  amplified 
fragments  are  981  bp  for  full  length,  676  bp  for  deletion  of  nt  1266-1570  (deletion  5'  of  CRl),  915  bp  for  deletion  of  1938-2003,  and  610  bp  for 
deletion  of  both. 


designed  to  amplify  the  region  of  alternative  splicing 
(Fig.  3).  Four  products,  corresponding  to  all  possible 
isoforms,  were  observed  in  most  tissues.  The  full-length 
isoform  was  consistently  the  most  abundant  species,  but 
collectively,  the  variant  species  represented  a  substantial 
fraction  of  all  transcripts.  The  smallest  isoform,  lacking 
nt  1266-1569  and  1938-2105,  was  the  least  abundant 
but  could  be  visualized  in  all  tissues  following  a  longer 
exposure  (data  not  shown).  Overall,  the  relative  abun¬ 
dance  of  all  isoforms  varied  little  from  one  tissue  to 
another.  Similar  results  were  obtained  using  a  panel  of 
cell  lines  from  diverse  origins.  We  conclude  that  all 
isoforms  are  expressed  in  all  tissues. 

3.6.  CDPjCut  and  CASP  transcripts  diverge  downstream 
of  exon  14 

As  shown  in  Fig.  ID,  splicing  downstream  of  exon 
14  can  involve  three  alternative  acceptor  sites,  one  of 
which,  generating  the  CASP  transcript,  is  located 
approximately  76  kb  downstream  (Lievens  et  al.,  1997). 


To  confirm  the  existence  of  this  transcript,  and  to 
compare  its  expression  relative  to  the  others,  we  per¬ 
formed  an  RNase  mapping  analysis  using  riboprobes 
derived  from  the  human  CASP  cDNA.  The  advantage 
of  a  CASP  riboprobe  in  this  experiment  is  that,  in 
addition  to  the  CASP-specific  protected  fragment,  only 
one  additional  fragment  should  be  generated  by  all 
CDP/Cut  mRNA  isoforms.  As  expected,  using  the 
human  CASP-specific  riboprobe,  two  protected  frag¬ 
ments  were  detected  in  human  placenta  (Fig.  4).  The 
ratio  between  these  two  fragments  did  not  vary  in 
diverse  human  cell  lines  (data  not  shown). 


4.  Discussion 

4.1.  Polymorphic  markers  within  CUTLl 

Analysis  of  the  organization  of  CUTLl  indicates  that 
it  spans  a  minimum  of  340  kb  and  comprises  33  exons 
ranging  in  size  from  47  to  1445  bp.  Three  polymorphic 
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Fig.  4.  RNase  mapping  analysis  of  CASP  and  CDP/Cut  in  human  placenta.  (Left)  Total  RNA  (40  pg)  from  human  placenta  was  subjected  to 
RNase  mapping  analysis  using  a  riboprobe  derived  from  the  CASPcDNA  and  encompassing  sequences  shared  by  the  CDP/Cut  and  CASP  mRNAs. 
Since  the  molecular  weight  markers  were  DNA  fragments,  the  calculated  sizes  of  protected  fragments  are  approximate.  (Right)  Schematic  representa¬ 
tion  of  the  human  CDP/Cut  and  CASP  cDNAs  are  displayed  from  exon  1  to  15  and  exon  1  to  25,  respectively.  The  probes  and  protected  fragments 
are  depicted  below.  The  human  CASP  riboprobe  spanned  nucleotides  1020  to  1316  of  the  HSCASP  sequence.  Note  that  the  fragment  protected 
by  the  CASP  mRNA  is  larger,  since  the  riboprobe  is  derived  from  its  cDNA. 


markers  previously  found  to  be  in  close  proximity  to 
the  CUTLl  gene,  D7S515,  D7S666  and  D7S518,  were 
mapped  to  introns  3,  6  and  20,  respectively.  Some  of 
these  markers  have  been  shown  in  other  studies  to  be 
included  in  the  smallest  commonly  deleted  region  in 
some  uterine  leiomyomas  and  breast  tumors  (Zeng  et  al., 
1997, 1999).  Interestingly,  among  12  breast  tumors  with 
a  loss  of  heterozygosity  (LOH)  of  7q22,  five  tumors 
exhibited  LOH  of  D7S515  while  retaining  D7S518  (Zeng 
et  al.,  1999).  In  one  of  these  tumors,  the  marker  D7S666 
within  intron  6  was  also  retained.  Thus,  the  centromere- 
proximal  border  of  the  smallest  commonly  deleted 
region  can  be  precisely  mapped  to  the  region  in 
between  introns  6  and  20,  further  implicating  CUTLl 
as  a  candidate  tumor  suppressor  gene.  It  should  be 
stressed,  however,  that  the  large  size  of  some  introns 
within  CUTLl  makes  it  possible  that  they  contain  other 
genes.  Indeed,  analysis  of  their  DNA  sequence  has 
revealed  several  open  reading  frames  (Glockner  et  al., 
1998). 

4.2.  Transcription  and  translation  start  sites 

We  have  cloned  the  genomic  fragment  containing 
exon  IB  and  generated  a  riboprobe  to  perform  RNase 
mapping  with  total  RNA  from  various  sources.  Multiple 
protected  fragments  were  observed.  Two  conclusions 
can  be  drawn  from  this  result.  First,  exon  IB  is  truly 
the  first  exon  for  some  of  the  CDP/Cut  mRNAs.  If 
exon  IB  was  in  fact  the  second  exon,  we  would 


have  observed  only  one  protected  fragment  whose 
boundary  would  correspond  to  the  junction  between 
intron  1  and  exon  2.  Secondly,  transcription  upstream 
of  exon  IB  is  initiated  at  multiple  sites  over  a  region  of 
approximately  200  bp.  This  finding  is  consistent  with 
the  fact  that  this  region  does  not  include  a  TATA  box 
and  is  GC-rich. 

The  sequence  in  the  5'  region  of  the  CDP/Cut  cDNA 
does  not  include  an  in-frame  termination  codon 
upstream  of  the  putative  ATG  initiation  codon.  As  a 
result,  the  open  reading  frame  can  be  extended  to  the  5' 
end  of  the  cDNA.  This  raised  the  possibility  that  the 
cDNA  was  not  full  length  and  that  another  translation- 
initiation  site  might  exist  further  upstream.  Indeed, 
analysis  of  the  genomic  DNA  sequence  at  the  5'  end  of 
the  gene  indicated  that  not  only  could  the  open  reading 
frame  be  extended  in  the  5'  direction,  but  also  two 
additional  in  frame  ATG  codons  were  present.  It  should 
be  stressed  that  the  riboprobe  used  in  the  RNase  protec¬ 
tion  assay  was  designed  in  such  a  way  as  to  easily  detect 
any  transcript  that  would  protect  the  entire  genomic 
sequence  of  the  probe  (see  Fig.  2A).  Since  no  such 
transcript  was  detected,  we  can  be  fully  confident  that 
no  mRNA  is  initiated  upstream  of  this  region. 
Moreover,  the  vast  majority  of  CDP/Cut  mRNAs  were 
found  to  start  downstream  of  these  additional  ATG 
codons.  In  light  of  these  results,  we  are  compelled  to 
conclude  that  the  ATG  codon  at  position  44  of  the 
CDP/Cut  cDNA  sequence  (nt  550  in  Fig.  2B)  is  most 
likely  the  correct  translation  initiation  site. 
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4.3.  Alternative  transcription  initiation,  splicing  and 
polyadenylation 

In  the  course  of  this  study,  several  cDNA  clones  were 
isolated  whose  DNA  sequence  suggested  the  existence 
of  alternative  splicing  events.  The  production  of  five 
alternatively  spliced  mRNAs  was  confirmed  by  RT- 
PCR  analysis  (Fig.  3)  and  RNase  mapping  analysis 
(Fig.  4).  Interestingly,  most  of  the  alternative  splicing 
events  are  concentrated  in  the  same  region  of  the  gene, 
downstream  of  exons  14  and  15.  Splicing  downstream 
of  exon  1 5  can  lead  to  linkage  with  exon  1 6  or  17,  while 
splicing  downstream  of  exon  14  can  involve  three 
different  acceptor  sites,  two  at  the  beginning  or  in  the 
middle  of  exon  15,  and  one  at  the  5'  end  of  exon  25, 
approximately  76  kb  downstream.  This  last  event  is 
responsible  for  the  production  of  CASP  mRNAs  which 
lack  exons  15  to  24  but  include  exons  25  to  33.  In 
contrast,  mRNAs  encoding  for  CDP/Cut  proteins  are 
spliced  from  exon  14  to  15  and  are  cleaved  at  the  end 
of  exon  24.  At  first  sight,  it  appears  that  there  might  be 
competition  between  several  splice  acceptor  sites  down¬ 
stream  of  exon  14.  However,  we  favor  the  view  that  the 
primary  decision  entails  the  3'  end  cleavage  and  polyade¬ 
nylation  at  the  end  of  exon  24.  Indeed,  since  exon  25  is 
located  approximately  24  kb  downstream  of  exon  24, 
there  should  be  a  sufficient  amount  of  time  to  cleave 
the  primary  transcript  at  the  end  of  exon  24  while  RNA 
pol  II  is  progressing  through  that  region.  Cleavage  at 
this  position,  in  turn,  would  signal  the  RNA  polymerase 
to  terminate  transcription  before  it  has  reached  exon 
25.  This  would  effectively  preclude  the  possibility  of 
exon  14  being  spliced  to  exon  25.  In  contrast,  synthesis 
of  the  CASP  mRNA  necessitates  that  the  primary 
transcript  not  be  cleaved  downstream  of  exon  24  and 
that  transcription  proceed  unabated  up  to,  and  beyond, 
exon  25.  In  this  respect,  we  note  that  the  polyadenylation 
signal  at  the  end  of  exon  24  is  probably  inefficient 
because  the  AAUAAA  motif  is  embedded  within  the 
sequence  AAAAUAAAA.  The  presence  of  an  excess  of 
A  residues  is  reminiscent  of  the  polyadenylation  signal 
at  the  end  of  the  late  transcription  unit  of  Polyoma  virus. 
It  was  shown  that  processing  of  primary  transcripts 
downstream  of  such  signals  is  inefficient.  As  a  result, 
transcription  was  found  to  continue  several  times  around 
the  circular  viral  genome,  generating  primary  transcripts 
with  multiple  genome  equivalents  (Acheson,  1984). 
Within  the  CUTLl  locus,  primary  transcripts  that  are 
elongated  up  to  exon  33  may  invariably  be  spliced 
between  exon  14  and  25,  but  the  possibility  of  cleavage 
downstream  of  exon  24  would  still  exist.  It  follows  that 
the  decision  to  make  CDP/Cut  or  CASP  mRNA  is 
based  on  the  competition  between  cleavage  at  the  end 
of  exon  24  and  splicing  between  exon  14  and  25  (result¬ 
ing  in  the  skipping  of  exon  24).  This  type  of  post- 
transcriptional  regulation  has  previously  been  described. 


notably  in  the  case  of  the  gene  encoding  for  calcitonin 
and  calcitonin  gene-related  peptide  [reviewed  in 
Edwalds-Gilbert  et  al.  (1997)]. 

4.4.  Differences  between  the  human  CUTLl  and  murine 
Cux  genes 

A  partial  characterization  of  the  Cux  gene,  the  murine 
homologue  of  CUTLl,  has  previously  been  reported 
(Tufarelli  et  al.,  1998).  Although  the  Cux  exons  have 
not  all  been  identified,  a  striking  difference  between  the 
two  genes  can  readily  be  observed.  While,  in  the  human 
CUTLl  gene,  CASP-specific  exons  (25-33)  are  located 
downstream  of  the  last  CDP/Cut  exon  (24),  in  the 
mouse  gene,  CASP-specific  exons  are  located  upstream 
of  the  Cut  repeat  3  encoding  exon.  Evolution  of  these 
two  genes  from  a  common  ancestor  would  require  a 
partial  duplication  of  the  gene  in  one  evolutionary 
branch,  followed  by  two  distinct  deletions.  Another 
puzzling  issue  is  the  mechanisms  of  polyadenylation  and 
splicing  that  lead  to  the  production  of  identical  proteins 
from  two  differently  organized  genes.  In  contrast  to  the 
situation  within  the  CUTLl  gene,  production  of  the  cux 
mRNA  would  require  that  the  CASP-specific  polyadeny¬ 
lation  signal  be  inefficient  in  order  to  allow  transcription 
readthrough  to  the  end  of  the  cux-specific  exons.  Future 
work  should  address  this  issue. 
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